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Preface 


This book is about the theory of continuous-state automated systems whose 
inputs, outputs and internal variables (temperature, speed, tension, etc.) can vary in a 
continuous manner. This is contrary to discrete-state systems whose internal 
variables are often a combination of binary sizes (open/closed, present/absent, etc.). 


The word “linear” requires some explanation. The automatic power control of 
continuous-state systems often happens through actions in relation to the gaps we 
are trying to control. Thus, it is possible to regulate cruise control by acting on the 
acceleration control proportionally to the gap observed in relation to a speed 
instruction. The word “proportional” precisely summons up a linear control law. 


Some processes are actually almost never governed by laws of linear physics. 
The speed of a vehicle, even when constant, is certainly not proportional to the 
position of the accelerator pedal. However, if we consider closed loop control laws, 
the return will correct mistakes when they are related either to external disturbances 
or to gaps between the conception model and the actual product. This means that 
modeling using a linear model is generally sufficient to obtain efficient control laws. 
Limits to the automated systems performances generally come from the restricted 
power of motors, precision of captors and variability of the behavior of the 
processes, more than from their possible non-linearity. 


It is necessary to know the basics of linear automated systems before learning 
about the theory of non-linear systems. That is why linear systems are a fundamental 
theory, and the problems linked to closed-loop control are a big part of it. 


Input-output and the state representations, although closely linked, are explained 
in separate chapters (1 and 2). Discrete-time systems are, for more clarity, explained 
in Chapter 3. Chapter 4 explains the structural properties of linear systems. Chapter 
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5 looks into deterministic and statistical models of signals. Chapter 6 introduces us 
to two fundamental theoretical tools: state stabilization and estimation. These two 
notions are also covered in control-related chapters. Chapter 7 defines the elements 
of modeling and identification. All modern control theories rely on the availability 
of mathematical models of processes to control them. 


Modeling is therefore upstream of the control engineer. However, pedagogically 
it is located downstream because the basic systems theory is needed before it can be 
developed. This same theory also constitutes the beginning of Chapter 8, which is 
about simulation techniques. These techniques form the basis of the control laws 
created by engineers. 


Chapter 9 provides an analysis of the classic invariable techniques while Chapter 
10 summarizes them. Based on the transfer function concept, Chapter 11 addresses 
pole placement control and Chapter 12 internal control. The three following chapters 
cover modern automation based on state representation. They highlight the 
necessary methodological aspects. H) optimization control is explained in Chapter 
13, modal control in Chapter 14 and Hə control in Chapter 15. Chapter 16 covers 
linear time-variant systems. 


Part 1 


System Analysis 


This page intentionally left blank 


Chapter 1 


Transfer Functions and Spectral Models 


1.1. System representation 

A system is an organized set of components, of concepts whose role is to 
perform one or more tasks. The point of view adopted in the characterization of 
systems is to deal only with the input-output relations, with their causes and effects, 
irrespective of the physical nature of the phenomena involved. 

Hence, a system realizes an application of the input signal space, modeling 


magnitudes that affect the behavior of the system, into the space of output signals, 
modeling relevant magnitudes for this behavior. 


<a 


Figure 1.1. System symbolics 


In what follows, we will consider mono-variable, analog or continuous systems 
which will have only one input and one output, modeled by continuous signals. 


Chapter written by Dominique BEAUVOIS and Yves TANGUY. 
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1.2. Signal models 


A continuous-time signal (t € R) is represented a priori through a function x(A) 
defined on a bounded interval if its observation is necessarily of finite duration. 


When signal mathematical models are built, the intention is to artificially extend 
this observation to an infinite duration, to introduce discontinuities or to generate 
Dirac impulses, as a derivative of a step function. The most general model of a 


continuous-time signal is thus a distribution that generalizes to some extent the 
concept of a digital function. 


1.2.1. Unit-step function or Heaviside step function U(t) 


This signal is constant, equal to 1 for the positive evolution variable and equal to 
0 for the negative evolution variable. 


UW 


Figure 1.2. Unit-step function 


This signal constitutes a simplified model for the operation of a device with a 
very low start-up time and very high running time. 


1.2.2. Impulse 
Physicists began considering shorter and more intense phenomena. For example, 


an electric loading Mu can be associated with a mass M evenly distributed 
according to an axis. 
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What density should be associated with a punctual mass concentrated in 0? This 
density can be considered as the bound (simple convergence) of densities Mu, (0) 


verifying: 


n 1 1 
o)= — -—-<O<— + 
Un ( ) 2 à n M [one = M 


4,(6)=0 elsewhere 


This bound is characterized, by the physicist, by a “function” 5(0) as follows: 


5(o)=0 o #0 
+00 
with Í 5(a)do =1 
ô (0) =+% 
However, this definition does not make any sense; no integral convergence 
theorem is applicable. 
Nevertheless, if we introduce an auxiliary function @(6) continuous in 0, we 
will obtain the mean formula: 


1 


+ y 1 
lim Í Wo) u, (o)do = (79) because -+< y < 
n—tood— V n n 


Hence, we get a functional definition, indirect of symbol 6: Sassociates with any 
continuous function at the origin its origin value. Thus, it will be written in all cases: 


9(0) = (6.9) = [ ~ 90) 5(o)do 


ô is called a Dirac impulse and it represents the most popular distribution. This 
impulse 6 is also written d(t). 

For a time lag ¢,, we will use the notations d(t-¢,) or 6u,) 0) ; the impulse is 
graphically “represented” by an arrow placed in ¢ = ¢,, with a height proportional to 


the impulse weight. 
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In general, the Dirac impulse is a very simplified model of any impulse 
phenomenon centered in t = tọ, with a shorter period than the time range of the 
systems in question and with an area S. 








x(t) t x(t) 
: Area S SÔ(t — to) 
| => 
D t me t 
T 


Figure 1.3. Modeling of a short phenomenon 
We notice that in the model based on Dirac impulse, the “microscopic” look of 
the real signal disappears and only the information regarding the area is preserved. 
Finally, we can imagine that the impulse models the derivative of a unit-step 
function. To be sure of this, let us consider the step function as the model of the real 


signal u(t) represented in Figure 1.4, of derivative u(t). Based on what has been 
previously proposed, it is clear that lim w/,(t) = 0(1). 
T0 


u(t) ugt) 


1 
T 





t t 
T 
2 


Nla 
Nla 


Figure 1.4. Derivative of a step function 
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1.2.3. Sine-wave signal 


x(t)= Acos(27f,t+@p) or x(t)= Ae! (27f.1+9) for its complex representation. 
fs designates the frequency expressed in Hz, w, = 27 f, designates the impulse 
expressed in rad/s and @ the phase expressed in rad. 


A real value sine-wave signal is entirely characterized by fọ (0< f, <+), by 
A (t= to), by @ (-7 < <+). On the other hand, a complex value sine-wave 
signal is characterized by a frequency f, with -œ < fọ < +0. 


1.3. Characteristics of continuous systems 


The input-output behavior of a system may be characterized by different 
relations with various degrees of complexity. In this work, we will deal only with 
linear systems that obey the physical principle of superposition and that we can 
define as follows: a system is linear if to any combination of input constant 
coefficients ajx; corresponds the same output linear combination, 


Dai = È a;G(x;). 


Obviously, in practice, no system is rigorously linear. In order to simplify the 
models, we often perform linearization around a point called an operating point of 
the system. 


A system has an instantaneous response if, irrespective of input x, output y 
depends only on the input value at the instant considered. It is called dynamic if its 
response at a given instant depends on input values at other instants. 


A system is called causal system if its response at a given instant depends only 
on input values at previous instants (possibly present). This characteristic of 
causality seems natural for real systems (the effect does not precede the cause), but, 
however, we have to consider the existence of systems which are not strictly causal 
in the case of delayed time processing (playback of a CD) or when the evolution 
variable is not time (image processing). 


The pure delay system t>0 characterized by y(t) = x(t —t) is a dynamic 
system. 
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1.4. Modeling of linear time-invariant systems 


We will call LTI such a system. The aim of this section is to show that the input- 
output relation in an LTI is modeled by a convolution operation. 


1.4.1. Temporal model, convolution, impulse response and unit-step response 


We will note by h,(t) the response of the real impulse system represented in 
Figure 1.5. 


x(t) h,(t) 
T 
t > 
T 


Figure 1.5. Response to a basic impulse 





Let us approach any input x(t) by a series of joint impulses of width 7 and 
amplitude x(kT). 


X(t) 


+ 


T 


Figure 1.6. Step approximation 


Transfer Functions and Spectral Models 9 


By applying the linearity and invariance hypotheses of the system, we can 
approximate the output at an instant £ by the following amount, corresponding to 
the recombination of responses to different impulses that vary in time: 


y= ¥ralke)h_(¢—ke) 


— co 


In order to obtain the output at instant t, we will make T tend toward 0 so that 
our input approximation tends toward x. Hence: 


lim x,(t) = d(t) and lim A,(t) = A(t) 


tT 0 T—0 


where A(t), the response of the system to the Dirac impulse, is a characteristic of 
the system’s behavior and is called an impulse response. 


If we suppose that the system preserves the continuity of the input, i.e. for any 


convergent sequence x, (f) we have G ( lim x, w) = lim G(x„(t)) , we obtain: 
n—>co noo 


+00 
y(t)= | xO -oao 
or: 
+00 
y(t) = | h(o)x(t- odo through o =t-@ 
which defines the convolution integral of functions x and h, noted by the asterisk: 


y(t) = x* h(t) = h* x(t) 


1.4.2. Causality 


When the system is causal, the output at instant ¢ depends only on the previous 
inputs and consequently function h(t) is identically zero for t <0. The impulse 
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response, which considers the past in order to provide the present, is a causal 
function and the input-output relation has the following form: 


+co t 
y(t) = Í MOxt-0d0= Í x(A)h(t-0)dO 
0 —co 


The output of a causal time-invariant linear system can be interpreted as a 
weighted mean of all the past inputs having excited it, a weighting characteristic for 
the system considered. 


1.4.3. Unit-step response 


The unit-step response of a system is its response i(t) to a unit-step excitation. 
The use of the convolution relation leads us to conclude that the unit-step response is 
the integral of the impulse response: 


t 
i(t) = [mao 
0 


This response is generally characterized by: 


— the rise time 1¢,,, which is the time that separates the passage of the unit-step 
response from 10% to 90% of the final value; 


— the response time, , also called establishment time, is the period at the end of 
which the response remains in the interval of the final value + @%. A current value 
of @ is 5%. This time also corresponds to the period at the end of which the impulse 
response remains in the interval + @%; it characterizes the transient behavior of the 
system output when we start applying an excitation and it also reminds that a system 
has several inputs which have been applied before a given instant; 

Ymax — Y(ee) 


— the possible overflow defined as © 
y co 


expressed in percentage. 


1.4.4. Stability 


1.4.4.1. Definition 


The concept of stability is delicate to introduce since its definition is linked to 
the structures of the models studied. Intuitively, two ideas are outlined. 
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A system is labeled as stable around a point of balance if, after having been 
subjected to a low interference around that point, it does not move too far away from 
it. We talk of asymptotic stability if the system returns to the point of balance and of 
stability, in the broad sense of the word, if the system remains some place near that 
point. This concept, intrinsic to the system, which is illustrated in Figure 1.7 by a 
ball positioned on various surfaces, requires, in order to be used, a representation by 
equations of state. 


Asymptotic stability Stability in the broad sense Unstable 


Figure 1.7. Concepts of stability 


Another point of view can be adopted where the stability of a system can be 
defined simply in terms of an input-output criterion; a system will be called stable if 
its response to any bounded input is limited: we talk of L(imited) I(nput) L(imited) 
R(esponse) stability. 


1.4.4.2. Necessary and sufficient condition of stability 


An LTI is BIBO (bounded input, bounded output) if and only if its impulse 
response is positively integrable, i.e. if: 


too 
[né |a6 < +22 


The sufficient condition is immediate if the impulse response is positively 
integrable and applying a bounded input to the system, Yt xe) | < M, leads to a 
bounded output because: 


+00 +0 
w bols | ke-ol Koja < MT Kodos ++ 


Let us justify the necessary condition: the system has a bounded output in 
response to any bounded excitation, then its impulse response is positively 
integrable. 
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To do this, let us demonstrate the opposite proposition: if the impulse response 
of the system is not absolutely integrable: 


+T 
VK,3T [ |~(@)|dO> K 


there is a bounded input that makes the output diverge. 


It is sufficient to choose input x such that: 


x(T —-0)=sgn(h(0)) for 0 <T and x(T-0)=0 for 0>T 


T 
then y(T)= Í h(@) sgn(h(@))d@> K WK which means that y diverges. 


1.4.5. Transfer function 


Any LTI is modeled by a convolution operation, an operation that can be 
considered in the largest sense, i.e. the distribution sense. We know that if we 
transform this product through the proper transform (see section 1.4.1), we obtain a 
simple product. 





x LTI y X LTI Y 
y(_) = h(_)* x(_) Y(_)=H(_)xXX(_) 
Time domain (convolution) Spectral range (product) 
Y(_) 
H(_)=——= 
X(_) 


This formally defined transform ratio is the transform of the impulse response 
and is called a transfer function of LTI. 


The use of transfer functions has a considerable practical interest in the study of 
system association as shown in the examples below. 
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1.4.5.1. Cascading (or serialization) of systems 


Let us consider the association of Figure 1.8. 


Figure 1.8. Serial association 


Hence y3(_ ) = A3 (_)* (hə (_)*(hı(_)*xı(_))). This leads, in general, to a 
rather complicated expression. 








In terms of transfer function, we obtain: 


H(_)=H(_)xH(_)xH3(_) 


i.e. the simple product of three basic transfer functions. The interest in this 
characteristic is that any processing or transmission chain basically consists of an 
association of “basic blocks”. 


1.4.5.2. Other examples of system associations 


+ 


+ 





Figure 1.9. Parallel association 


a )= RES +) 
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Figure 1.10. Loop structure 


Fes Ay(_) 


ee E(_) 1+4\(_)xH(_) 





The term 1+H,(_ )xH,(_ ) corresponds to the return difference, which is 
defined by 1 — (product of loop transfers). The loop transfers, introduced in the 


structure considered here, are the sign minus the comparator, the transfers H, and 
Hy. 
1.4.5.3. Calculation of transfer functions of causal LTIs 


In this section, we suppose the existence of impulse response transforms while 
keeping in mind the convergence conditions. 


Using the Fourier transform, we obtain the frequency response H(f): 


+00 
A(f)= Í h(0) e 2200 = |HP) |e) 
0 


where |H (f )| is the module or gain, o( f ) is the phase or phase difference of the 
frequency response. 


Through the Laplace transform, we obtain the transfer function of the system 
H(p), which is often referred to as isomorphic transfer function: 


+00 


H(p)= | nO” 
0 
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The notations used present an ambiguity (same H) that should not affect the 
informed reader: when the impulse response is positively integrable, which 
corresponds to a stability hypothesis of the system considered, we know that the 
Laplace transform converges on the imaginary axis and that it is mistaken with 
Fourier transform through p = 27 j f . Hence, the improper notation (same A): 


Hp). =H) 


p=2n jf 


We recall that the transfer functions have been formally defined here and the 
convergence conditions have not been formulated. For the LTIs, which are system 
models that can be physically realized, the impulse responses are functions whose 
Laplace transform has always a sense within a domain of the complex plane to 
define. 


On the other hand, the frequency responses, which are defined by the Fourier 
transform of the impulse response, even considered in the distribution sense, do not 
always exist. The stability hypothesis ensures the simultaneous existence of two 


transforms. 


EXAMPLE 1.1— it is easily verified whether an integrator has as an impulse 
response the Heaviside step function A(t) = u(t) and hence: 


a 1 = — PEER 
H(p)= = Alf)= ; S(f)+ F Pf ; 


E dense il 
where Pf 5 designates the pseudo-function distribution — . 


An LTI with localized constants is represented through a differential equation 
with constant coefficients with m <n: 


boy(t)+ +8,» (0) = agx(t)+ et apx (6) 
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By supposing that x(t) and y(t) are continuous functions defined from — to 
+ , continuously differentiable of order m and n, by a two-sided Laplace transform 
we obtain the transfer function H(p): 


(bo +bypt...+b,p")Y(p)= (dg +aıp+...amp”)X(p) 


ag+tap+...app” 
bo +bip +... +b p” 





H(p)= 


Such a transfer function is called rational in p. The coefficients of the numerator 
and denominator polynomials are real due to their physical importance in the initial 
differential equation. Hence, the numerator roots, called zeros, and the denominator 
roots, called transfer function poles, are conjugated real or complex numbers. 


If x(t) and y(t) are causal functions, the Laplace transform of the differential 
equation entails terms based on initial input values x(0), x’(0), xm) (0) and 
output values y(0), y‘(0), yard (0); the concept of state will make it possible to 
overcome this dependence. 


1.4.6. Causality, stability and transfer function 


We have seen that the necessary and sufficient condition of stability of an SLI is 


for its impulse response to be absolutely integrable: ime \n(a jaa < too, 


The consequence of the hypothesis of causality modifies this condition because 
we thus integrate from 0 to+c. 


On the other hand, if we seek a necessary and sufficient condition of stability for 
the expression of transfer functions, the hypothesis of causality is determining. 


Since the impulse response n(6) is a causal function, the transfer function H(p) 
is holomorphic (defined, continuous, derivable with respect to the complex number 
p) in a right half-plane defined by Re(p) > ©, . The absolute integrability of h(@) 
entails the convergence of H(p) on the imaginary axis. 


A CNS of EBRB stability of a causal LTI is that its transfer function is 
holomorphic in the right half-plane defined by Re(p)20. 
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where N(p) and D(p) are polynomials, it is the same as saying that all the transfer 
function poles are negative real parts, i.e. placed in the left half-plane. 


We note that in this particular case, the impulse response of the system is a 
function that tends infinitely toward 0. 


1.4.7. Frequency response and harmonic analysis 


1.4.7.1. Harmonic analysis 


Let us consider a stable LTI whose impulse response A(O) is canceled after a 
period of time fp . For the models of physical systems, this period of time fp is in 
fact rejected infinitely; however, for reasons of clarity, let us suppose fp as finite, 
corresponding to the response time to 1% of the system. 


When this system is subject to a harmonic excitation x(t) = Ae Ho! from 
t = 0, we obtain: 


t 


= | 


(h(a) Aer a = Ae (0) dena 


For ¢ > tg , the impulse response being zero, we have: 
t wp +00 A 5 
f hO)? 7 d0 =H (fo) = f hO) e7 d0 =|H( fy iP 


and hence for t > tp , we obtain y(t) = AH(F,)e27 it = AJH ef 7P | 


This means that the system, excited by a sine-wave signal, has an output that 
tends, after a transient state, toward a sine-wave signal of same frequency. This 
signal, which is a characteristic of a steady (or permanent) state, is modified in 
amplitude by a multiplicative term equal to |H ( fl and with a phase difference of 


Po). 
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x()= A, sinQafot) LTI y@)= A, sinQafor + D) 


A 
ra = |H(fo)| module or gain ® = arg H(f,) phase 


x 


We note that H(f) is nothing else but the Fourier transform of the impulse 
response, the frequency response of the system considered. 
1.4.7.2. Existence conditions of a frequency response 

The frequency response is the Fourier transform of the impulse response. It can 
be defined in the distribution sense for the divergent responses in 1” but not for 


exponentially divergent responses (el!) . However, we shall note that this response 


is always defined under the hypothesis of stability; in this case and only in this case, 
we pass from transfer functions with complex variables to the frequency response by 
determining that p=2xjf. 


EXAMPLE 1.2.- let h(t) = u(t) be the integrator system: 


1 
H(p) = 7 
1 1 1 1 
aE ares clr and not jf because the system is not EBRB 


stable. 


H(p)=TL(u(t)) is defined according to the functions in the half-plane 
Re(p) > 0, whereas H(f) = TF(u(r)) is defined in the distribution sense. 


Unstable filter of first order: A(t)=e' 120 


1 ; 
lp) > defined for Re(p)>1, H(f) is not defined, even in the 
distribution sense. 
Hence, even if the system is unstable, we can always consider the complex 


number obtained by formally replacing p by 27jf in the expression of the transfer 
function in p. The result obtained is not identified with the frequency response but 
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may be taken as a harmonic analysis, averaging certain precautions as indicated in 
the example in Figure 1.11. 


Let us consider the unstable causal system of transfer function 





T inserted 


into the loop represented in Figure 1.11. 


x(t) K u(t) shal yt) 


Figure 1.11. Unstable system inserted into a loop 











The transfer function of the system is 





2 
T The looped system is stable and 


hence we can begin its harmonic analysis by placing an input sine-wave signal 
x(t) = A, sin(27% fot). During the stationary regime, y(t) and u(t) are equally 


sinusoidal, hence: 





























A 2 2 
y(t) = A, sin fot +P ,) with = ————_| and ® ,, = arg} —— 
? oray A, |2ajfo +1 y 2rifo +1 
A, l2C#fo -1 2(2%jfo -1 
u(t) = A, sinQafot+®,) with £ = Bafo IES = arg Co ) 
A, 2#%f0 +1 2#%f0 +1 
Hence: 
A 1 
i 
— =|_—__|=H | 
A, |2Zifo -1 Pl permit 
®,-®, = l = argH(p)| 
ye NE ro) CP Pré 
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Table 1.1 sums up the features of a system’s transfer function, the existence 
conditions of its frequency response and the possibility of performing a harmonic 
analysis based on the behavior of its impulse response. 


1.4.7.3. Diagrams 





h(t) HG) HQrjf)= H(jo) 


H(p) has its poles on the left of 
the imaginary axis. 





oo TF exists 
[ |n(@)| do < +00 Possible direct analysis 
EBRB stability Holomorphy TESH 7 





H(p) has a pole of order n at the 
origin. 

Directly impossible 

harmonic analysis 


t> +o |A| 1" | monie 
impossible directly except 


: 


HOlOHOEDDY, for a simple pole at the 
origin 
H(p) has poles on the right of 
the imaginary axis. 

# Possible analysis if the 
t > +% |ace)| xe system is introduced in a 
k>0 + Paco, stable looping and 

H(jw)= H(p)| 


p=jo 





H(p) has poles on the imaginary 
axis. 


t—> +00 |NO) ~ el 


Holomorphy 


à 

















Table 1.1. Unit-step responses, transfer functions and 
existence conditions of the frequency response 
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Frequency responses are generally characterized according to impulse 
w=2njf and data |H ( jæ) and ®(jæ) grouped together as diagrams. The 
following are distinguished: 


— Nyquist diagram where the system of coordinates adopts in abscissa the real 
part, and in ordinate the imaginary part H( D ; 

— Black diagram where the system of coordinates adopts in ordinate the module 
expressed in decibels, like: 


20 logioH (p) lea and in abscissa arg H( p)| expressed in degree; 


p=jo 
— Bode diagram which consists of two graphs, the former representing the 

module expressed in decibels based on logjg(@) and the latter representing the 

phase according to log,g(@). Given the biunivocal nature of the logarithm function 

and in order to facilitate the interpretation of the diagram, the axes of the abscissas 

are graduated in ©. 

1.5. Main models 


1.5.1. Integrator 


This system has for impulse response A(t) = KU(t) and for transfer function in p: 


HET Re (p)> 0 


The unit-step response is a slope ramp K : i(t) = KtU(t). 


The frequency response, which is the Fourier transform of the impulse response, 
is defined only in the distribution sense: 


1 1 1 
MESET F 





The evolution of H( Pa according to © leads to the diagrams in Figure 1.12. 
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Figure 1.12. Bode diagram 


The module is characterized by a straight line of slope (—1), -6 dB per octave 
(factor 2 between 2 impulses) or —20 dB per decade (factor 10 between two 
impulses), that crosses the axis 0dB in w= K. 


30-7 . 
DER ER E : 
20 4... | 
[A emmener : 





Module (dB) 








-135 -90 -45 
Phase (degree) 


Figure 1.13. Black diagram 
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Imaginary part 
= = | 
un 2 un 


Y 
© 





Real part 


Figure 1.14. Nyquist diagram 


1.5.2. First order system 
t 


This causal system, with an impulse response of h(t) = Ke TU(t), has a transfer 
function: 


K 1 
H(p) =— Re(p) >-— 
(p) I+ Tp (p) T 


The unit-step response admits as time expression and as Laplace transform the 
following functions: 


t 


i(t)=K|1-e T lun I(p}=— À 


> p(1+7p) 


It has the following characteristics: 
— the final value is equal to K, for an input unit-step function; 
— the tangent at the origin reaches the final value of the response at the end of 


time 7, which is called time constant of the system. 


The response reaches 0.63 K in T and 0.95 K in 3 T. 
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Unit-step 
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0 1 2 3 4 5 
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Figure 1.15. Unit-step response of the first order model 
The frequency response is identified with the complex number H ( j w) : 


| (ja) | = arg(H(j«)) = — Arctg(@T ) 


K 
V1+@°T? 


In the Bode plane we will thus have: 


and —Arctg(oT ) according to logio (a) 


The asymptotic behavior of the gain and phase curves is obtained as follows: 


oT <<] oT >> 1 


K 
2010g10/ 4] = 20 logy K 20logig|H| = 20 TES — 201080 © 


m 
arg H = Ord TRES 
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These values help in building a polygonal approximation of the plot called Bode 
asymptotic plot: 
— gain: two half-straight lines of slope (0) and —20 dB/decade noted by (-1); 


— phase: two asymptotes at Ord and — A rd . 


— 
© 








Module (dB) 


Phase (degree) 





107! 100 10! 


Figure 1.16. Bode diagram of the first order system 


The gain curve is generally approximated by the asymptotic plot. 


The plot of the phase is symmetric with respect to the point (œ = VAN) =- 45°). 
The tangent at the point of symmetry crosses the asymptote 0° at w= 1 4.97 and, by 
symmetry, the asymptote —-90° at w = +57. 


The gaps OG and 6@ between the real curves and the closest asymptotic plots 
are listed in the table of Figures 1.17 and 1.18. 
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Figure 1.17. Black diagram 
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Figure 1.18. Nyquist diagram 
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1.5.3. Second order system 


1 ; 
The second order system, of angular frequency Wg = A and of damping 


coefficient £, is defined by a transfer function such as: 





K 

H(p)= 5 
ig Sa 

Wo Mo 


1.5.3.1. Unit-step response 


The theorems of initial and final values make it possible to easily comprehend 
the asymptotic features of the unit-step response: zero initial value, final value equal 
to K, tangent at the origin with zero slope. 


Based on the value of é with respect to 1, the transfer function poles have a real 
or complex nature and the unit-step response looks different. 


€>1: the transfer function poles are real and the unit-step response has an 
aperiodic look (without oscillation): 


FN PM 


i(t)=K = (1= a)e D +(1+ ae 


é 
é? -1 


where a = 


The tangent at the origin is horizontal. 


If é >> 1, one of the poles prevails over the other and hence: 


Wt 


i(t)= K|1-e 7° 
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Figure 1.19. Unit-step response of the second order system £ = 1 


é =1: critical state. The roots of the denominator of the transfer function are 
real and merged, and the unit-step response is: 


t 
eK) 1-\ie-- |e % 
To 
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Figure 1.20. Unit-step response of the second order system < 1 
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é <1: oscillating state. The two poles of H(p) are conjugated complex numbers 


and the unit-step response is: 


-Sot 
i(t) = K 1- oof conf -8 -a) where tan Tos 
I-¢ 


2 , : 
T, = i is the pseudo-period of the response. 


or 


, ; ; T, 
The instant of the first maximum value is t„ = — 


Ka 
1-é? 





The overflow is written D = 105 =e 


The curves in Figure 1.21 provide the overflow and the terms 
establishment time at 5%) and Wot, according to the damping ¢. 


30 
| Cot: 
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Figure 1.21. Wt, and Wot, according to the damping 


Wot, (t, is the 
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The alternation of slow and fast variations of product Wot, is explained because 
instant f, is defined in reference to the last extremum of the unit-step response that 
exits the band at a final level of +5%. When € increases, the numbers of extrema 
considered can remain constant (slow variation of £,), or it can decrease (fast 
variation of é,). 


100 











Figure 1.22. Overflow according to the damping E 


1.5.3.2. Frequency response 


K 
2 
1-2 oe 
Mo Wo 


£>1: the system is a cascading of two systems of the first order H, and H,. 
The asymptotic plot is built by adding the plots of the two systems separately built 
(see Figure 1.23). 


H(jo) = 
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& <1: the characteristics of the frequency response vary according to the value 
of é. Module and phase are obtained from the following expressions: 








K 
IHGo)| = He) = Are) 
ao 2 
1-5] +42 
w w 
1 : . 
For č < — , the module reaches a maximum A = ——— in an angular 


max 
2 
2éy1-é 
frequency called of resonance ©, = ©) y1- 26 oe, 


We note that the smaller é, the more significant this extremum and the more the 
phase follows its asymptotes to undertake a sudden transition along @, . 


Finally, for é = 0 , the system becomes a pure oscillator with a infinite module in 
@, and a real phase mistaken for the asymptotic phase. 


Figures 1.23, 1.24, 1.25 and 1.26 illustrate the diagrams presenting the aspect of 
the frequency response for a second order system with different values of é. 
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Figure 1.23. Bode diagram of a second order system with 2 1 
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Figure 1.25. Nyquist diagram of a second order system with E< 1 
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Figure 1.26. Black diagram of a second order system with é < 1 


1.6. A few reminders on Fourier and Laplace transforms 


1.6.1. Fourier transform 


Any signal has a reality in time and frequency domains. Our ear is sensitive to 
amplitude (sound level) and frequency of a sound (low or high-pitched tone). These 
time and frequency domains, which are characterized by variables that are opposite 
to one another, are taken in the broad sense: if a magnitude evolves according to a 
distance (atmospheric pressure according to altitude), the concept of frequency will 
be homogenous, contrary to a length. 


The Fourier transform is the mathematical tool that makes it possible to link 
these two domains. It is defined by: 


boo 
X(f)= Í x(the 27H dt = TF(x(t)) 


—oo 


When we seek the value X(f) for a value f, of fthat means that we seek in the 
whole history, past and future, of x(t) which corresponds to frequency f,. This 
corresponds to an infinitely selective filtering. 
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The energy exchanged between x(t) and the harmonic signal of frequency 
to (ei of) can be finite. X(f,) is then finite, or infinite if x(t) is also a harmonic 
signal and X(f) is then characterized by a Dirac impulse 5, ty): 


According to the nature of the signal considered, by using various mathematical 
theories concerning the convergence of indefinite integrals, we can define the 
Fourier transform in the following cases: 


— positively integrable signal: [Eo | dt <œ. The integral definition of the TF 
converges in absolute value. X(f) is a function that tends toward 0 infinitely; 


— integrable square signal or finite energy signal f x(t )? dt <œ. The integral 
definition of the TF exists in the sense of the convergence in root mean square: 


+A : 
limy o Í X(f)- Í xe a} df =0 


-A 


— slightly ascending signal: JA and k rl >A > |x(t) | < I . The Fourier 
transform exists in the distribution sense. We also note the transforms in the sense of 
following traditional distributions: 


TF (O(a) = e °“JY and its reciprocal function TF(e27/4!) = Bay) 


r Fak -er)]- Se 2akr = 5 (r-4] 
FES 


1 
k=- T k=- 


1.6.2. Laplace transform 


When the signal considered has an exponential divergence, irrespectively of the 
mathematical theory considered, we cannot attribute any sense to the integral 
definition of the Fourier transform. 


The idea is to add to the pure imaginary argument 27 j f a real part 0 which is 
chosen in order to converge the integral considered: 


+00 
Í FOAI 


—oo 
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By determining that p = o+ 2njf, we define a function of a complex variable, 
called the Laplace transform of x(t), defined into a vertical band of the complex 
plane, which is determined by the conditions on o ensuring the convergence of the 
integral: 


x(p)= | xee at = L(x) 


— co 


The instability phenomena that can interfere in a linear system are characterized 
by exponential divergence signals; hence, we perceive the interest in the complex 
variable transformations for the analysis and synthesis of linear systems. 


We note that the complex variable that characterizes the Laplace transform is 
noted by s. 


Let us suppose that x(r) is of exponential order, 1.e. locally integrable and as it is 
in two positive real numbers A and B and in two real numbers @ and B so that: 
Vin 20 |x|< Ae” 
VtSt <0 fx |< Be” 
b 


and x(t) locally integrable & Va,b finite [ko |dt <oo, 


a 


The Laplace transform (LT) exists if [X( p)| exists. However: 


t t +00 
X(2)|s| [roc Par f xe Parl +) [ aoea 
Sa h ti 


t 
Í x(t)e”dt| is bounded because x(t) is locally integrable. 


h 


D ty 
Í x(t)e ”dt| < B Í eP- dt which converges if o = Re(p)< £. 


—oo —oo 
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+00 +00 
Í xe dt < A Í ePi dt which converges if o = Re(p)>«. 


t ti 
The LT thus exists for & < ø = Re(p) < f. 


Let c} and o> be the values of & and J ensuring the tightest increases of the 
signal module for ¢ tending toward +œ. We will call the group consisting of 
Junction X(p) and the convergence band lo, o2] Laplace transform (two-sided) 
which is sometimes noted by: 


TLI} = {X(p).161:02 1} 


We note that in the convergence band, the integral that defines the Laplace 
transform is absolutely convergent, hence the properties of holomorphy, continuity 
and derivability with respect to p, of e P are carried over X( p). 


; ; x(t)=e” 120 
EXAMPLE 1.3. — consider the signal defined by 


Ase” 1<0 


Determining the transform of x(t) supposes the following evaluations: 

















too (a-p)t T 
[ Par = £ -l if a< Re(p) 
0 DEN RE MN 
à -0 
(b-p)t 1 
| Par = Le =- if Re(p)< b 
p-b 3 - re 


Provided a is strictly less than b, so that there is a complex plane domain where 
the two integrals considered are convergent, x(t) will admit for LT the function: 
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Saying that a <b means in the time domain: 

— for a>0, the causal part (t 2 0) exponentially diverges which implies, 
b > 0, that the anti-causal part (£ < 0) converges faster toward 0 than the causal 
part diverges; 

— for b <0, the anti-causal part (t < 0) exponentially diverges which implies, 
a <0, that the causal part (> 0) converges faster toward 0 than the anti-causal 
part diverges. 


All the anti-causal signals, zero for t > 0 , that have a Laplace transform are such 
that this transform is defined into a left half-plane (containing Re(p) =—c-). 


All the causal signals, zero for ¢ < 0, that have a Laplace transform are such that 
this transform is defined into a right half-plane, (containing Re(p)=+). The 
transform of such signals is again called one-sided transform. 


For a positively integrable (causal or anti-causal) signal: 


+00 
X(p) for Re(p) = 0 is increased in module by Í Lx) ldt. 


—0c° 


Its Laplace transform always exists, the associated convergence band containing 
the imaginary axis. Hence, we notice the identity between the Laplace transform and 
the Fourier transform because on the imaginary axis: 


+00 
2 Í x(the 27H dt = TF(x) 
p=2n jf 


—0c° 


X(p) 





Finally, we note that the concept of Laplace transform can be generalized in the 
case where the signals considered are modeled by distributions. We recall from what 
was previously discussed in this chapter that the popular Dirac impulse admits the 
constant function equal to 1 as Laplace transform: 


TL(8(t)) = Í Sie Par =[eP]| =1 


—oo 


38 Analysis and Control of Linear Systems 


t=a 


TLE (q(t) = [e-e Patrze] =e” 


1.6.3. Properties 


As we have already seen, the Fourier and Laplace transforms reveal the same 
concept adapted to the type of signal considered. Thus, these transforms have similar 
properties and we will sum up the main ones in the following table. 


We recall that U (t) designates the unit-step function. 


Fourier transform Laplace transform 


Linearity 

TF (Ax + wy) = ATF(x)+ uTF(y) TL(Ax + uy) = ATL(x)+ uTL(y) 

The convergence domain is the intersection of 
each domain of basic transforms. 


o] X(p) 


a < Re(p)< a 


x) x(f) 


Delay 
xlt- r) —E x(f)e f 2 | X(p)e? 


0, < Rep) <a 
Time reverse 
LD Sayer) TEE X(- p) 


— O2 < Re(p)< —0] 


Signal derivation 
The signal is modeled by a continuous function: 


dx TL pX(p) 
dx TF ; —_—> 
gy lN) dt a < Re(p) < 072 
This property verifies that the signal is 


| ARE The signal is modeled by a function that has a 
modeled by a function or a distribution. 


discontinuity of the first kind in #, : 
(x(t, +0)- x(t, - 0))= So 


dx TL y pX(p)- So e™P 
dt o < Re(p)< © 
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Case of a causal signal: 

x(t) = U(t) x(t) 
S, =x(0*)-0 
d = + 
je pX(p)— x(0") 
dt 0, < Re(p)< 07 

Transform of a convolution 

Through a simple calculation of double integral, it is easily shown that: 


TF(x* y)= X(f) Y(f) TL(x* y)= X(p) Y(p) 


defined in the intersection of the convergence 
domains of X(p) and Y(p) 


The Laplace transform also makes it possible to determine the behavior at the 
time limits of a causal signal with the help of the following two theorems. 


THEOREM OF THE INITIAL VALUE.- provided the limits exist, we have: 


lim x()= lim pX(p) 
10" Re(p)—+ce 


THEOREM OF THE FINAL VALUE.- provided the limits exist, we have: 


lim x(t)= lim pX(p) 
t—+oo Re(p)—0 


The convergence domain of X(p) is the right half-plane, bounded on the left by 


the real part of the pole which is at the most right (convergence abscissa do ) 
because signals are causal. 
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1.6.4. Laplace transforms of ordinary causal signals 





Op Convergence 









































xO Xp) abscissa 
So 1 —s 
50) p” me 
U(t)=1 t20 1 0 
U(t)=0 1<0 p 
1 
Ue] T -a 
n! 
U(t” n+1 0 
n! 
Ue] -a 
Dt e (p + a)" +1 
w 
U(t)sin(œt) 7 7 0 
p +@ 
U(t) cos(wt) 3 4 7 0 
p +@ 
: w 
ee = 
U(tye ™ sin(œf) ie = a)? ean 
pta 
U(t)e ™ cos(or) -a 





(p + a)? +0 
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1.6.5. Ordinary Fourier transforms 






































xO TFD) 
1 Sco) 
P 1 |" 6m) 
(a) 8 
eih S) 
sin(27 fot) 3 oh = 3 oh) 
cos( 27 fot) Bs Mee 
0 2 (fy) 2 (-fo) 
1 1 
sgn(t) =P 
mw f 
1 Puit 
CT Ppp 
U(r) MORE: 
(0) 1 
ôn) e 2%ft 
ee 1 tS 5 
Sor) T yaa (%) 
x(t)=1 te [0,7] Ty eh sin(yTo ) 
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Chapter 2 


State Space Representation 


Control techniques based on spectral representation demonstrated their 
performances though numerous industrial implementations, but they also revealed 
their limitations for certain applications. 


The objective of this chapter is to provide the basis for a more general 
representation than the one adopted for the frequency approach and to offer the 
necessary elements to comprehend time control through the state approach. 


Time control, which is largely used in space and aeronautic applications but also 
in industrial applications such as servomechanisms, is based on the representation of 
systems through state variables. According to the matrix structure adopted, this 
modeling is also currently used during the synthesis of control laws, irrespective of 
the method chosen. 


This internal representation, which is richer and more global than the input- 
output representation, is enabling the representation, in the form of a matrix, of any 
system: invariant or non-invariant, linear or non-linear, mono-variable or multi- 
variable, continuous or discrete. This will be presented in this chapter, along with a 
few fundamental properties such as stability, controllability and observability. 


Chapter written by Patrick BOUCHER and Patrick TURELLE. 
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2.1. Reminders on the systems 


A physical system receives external excitations, stores information and energy, 
and delivers them according to the traditional diagram represented in Figure 2.1. 





Input Dynamic Output 
or u(t) Sats y(t) or 
controls y measurement 


Figure 2.1. Dynamic system 


The controls and outputs are generally multiple and their evolution is as a 
function of time. In the determinist case which we are dealing with here, knowing 
the controls u(t) from —c makes it possible to know the outputs y(t) at instant t. 


2.1.1. Internal representation of determinist systems: the concept of state 


A system can be represented externally through the relations that link the inputs 
to the outputs, which are described in the vector form: 


yG)= h{u(r)} Te [0,[ 


Hence, linear and invariant systems (LIS) are traditionally represented, in the 
mono-variable case, by the convolution equation: 


y(t) = A(t) *u() = f h(t)u(t-Ddt = f h(t—Tu(r) dr 


This representation is linked to the concept of transfer function by Laplacian 
transformation of the convolution equation: 


Y(p)= H(p)U(p) 


However, we are soon limited at the level of these representations by the non- 
linearities and non-stationarity of systems. Hence, it is interesting to consider what 
we call the state of a system in order to obtain a so-called “internal” representation. 
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The state is a vector quantity x(¢) defined at any instant which represents the 
set of information and energies stored by the system at instant £. 


EXAMPLE 2.1.— let us consider the system consisting of a body in freefall in the 
field of gravity (supposed to be constant). The principle of energy conservation 
leads to the relation: 


1 2 
Eiotal = E potential + Exinetic = Mgz + aad = constant 


Knowing v (speed of the body) and z (altitude of the body) is sufficient in order 


oi 
v(t) 


to characterize the evolution of this mechanical system. Hence, vector x(t) = | 


makes it possible to describe this evolution. 


Thus, knowing the state of a system at instant ¢'<¢ and the controls u(t) 
applied to the system between instants ¢' and f , the system output is written as: 


y(t) =h,,-[x(t'), u(z)] for r'<T<t 
Similarly, the evolution of the state will be expressed by the relation: 
x(t) = Ø p [x(t'),u(Z)] for r'<T<t 


We note that x(t’) can be expressed from x(t") ({"<f') and the controls u(z') 
applied to the system between instants ¢" and ¢': 


X(t’) = Pp p[x(t"), (TI for 1"<T'<1i' 
which leads to: 
X(t) = Q [Oy p(X"), W(T')],u(Z)] for 1"<T'<t'<T<t 
Between" and f, we have: 


X(t) = Pa" [x(¢"), u(z")] for t"<T"<t 
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The comparison between these two results leads to the property of transition, 
which is essential for the systems that we are analyzing here: 


Pi, X"), WT") = A, 165, Txt"), (TI, u()] 
t"<7"<t M<s'<t<t 


and which characterizes the transition of a state x(t") to a state x(t) by going 
through x(f'). 


The state of a system sums up its past entirely. 


2.1.2. Equations of state and equations of measurement for continuous systems 


Knowing state x(t) at instant ¢ and the controls applied for t < 7<t+At, we 
have the relations: 


X(t+ At) = Paart XO, UCT)] 


x(tt+At)—x(t)  P+azr [x(t),u(7)]— 9, ;[x(t), u] 
At At 





The equation of state is then obtained by going at the bound (At — 0) in this 
relation. Hence, we obtain the matrix differential equation: 


= = fix), u(t), t] 


in which the control vector u(t) has m components, and the state vector x(t) is 
characterized by n components called state variables. 


The vector of measurements or of observations has / components and the 
equation of measurement has the following form: 


y(t) = h[ x(t), u), t] 
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The evolution of a dynamic system subjected to inputs u(t) and delivering 
information y(t) is entirely characterized by a vector x(t) of size n linked to the 
inputs and outputs by the relations: 


x(t) = fIx(t), ur), t] 


y(t) = hix), u), t] 


x, (t) u (t) yy) 
see 20 iz 20 ME 20) 
x, (t) Un (t) y(t) 


2.1.3. Case of linear systems 


When the system is linear (principle of superposition of causes and effects), the 
equations of state and measurement have the following form: 


x(t) = A(t) x(t) + B(4) u(t) 


y(t) = CH) x (0) + D@) uc) 


in which: 
— A(t) is the evolution matrix (dim nx n) ; 
— B(t) is the control matrix (dim nxm); 
— C(t) is the observation matrix (dim [x n); 
— D(ż) is the direct transmission matrix (dim xm). 


We note that state representation is not unique and in any case we would have to 
talk of a state representation (see section 2.3.2). 
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2.1.4. Case of continuous and invariant linear systems 


The four state representation matrices are constant, irrespective of time: 
x(t) = A x(t)+B u(t) 


y(t)=C x(t) + Du(t) 


2.2. Resolving the equation of state 


Firstly, let us recall the nature of the magnitudes used: 

— state vector: x(t) column vector nx1; 

— control vector: u(t) column vector m x 1; 

— observation vector: y(t) column vector / x 1; 

— equation of state: x(t) = A(t) x(t) + B(t)u(t); [2.1] 
— equation of measurement: y(t) = C(t) x(t) + D(t) u(t). [2.2] 


We will approach the resolution of the equation of state in two instances: 
— free state u(t) =0; 
— forced state u(t) #0. 


2.2.1. Free state 


This refers to solving the equation x(t) = A(t) x(t) from the initial condition 
X(¢9) =Xq- Since the equation is linear, the solution x(t) is expressed linearly 
according to x(t) ) as follows: 


x(t)= (t,o) x(to) 


The matrix ®(t, tọ) of size nxn is called a system transition matrix. It has the 
following properties: 


D, to) =D(t,t,) Dh, to); (transition property previously mentioned) 


D 1) =1; (x =t, t) x) 
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—1 
(t, t) = (h, ty) 


(x(t) = ®(y, 4) x(t) > x(t) = ®(G, 2)x(t)) 


Pir to) = AW) ®(t, 19) Vig; (KG) = AG) xt) and X(t) = (1, to) x(to)) 


2.2.2. Forced state 

This refers to solving the equation x(t)= A(t) x(t)+B() u(t) for u(t) 40, 
knowing the general solution of the homogenous equation (x(t) =O(t,t9)x(tg)). 
Then, a particular solution of the complete equation is searched for as 
X p(t) =O(¢,to z(t), where function z(t) is the unknown factor obtained from the 
initial condition x pito)=9 (2(to) = 0). 


By deriving x ,(¢) and by transferring in [2.1] we have: 

. d® : 

Xp (D = Frs (t, to) ZA +Ò (t, to) Z(t) = ADD (t, to) Z(t) + BO) u(t) 
However: 


ot ty) =AMV(E, to) > P(t, to) iA) =B) ult) © 
t 
40 = [t,t] BO w(t) 

or by considering the initial condition on z: 

t t 

z(t) = foc. to) B(nu(r)dr => x, (1)= D(t0) Joc. to)! B(T) u(r) dr 

to to 

and by using the properties of the transition matrix mentioned above: 


x, (0 = Í D(t, T)B(T)dT 


to 
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The general solution of the complete equation is then expressed by: 


t 
x(t) =®(t, to) x(t) + f DBT u(t) dr 


lo 


and the output y(t) is then simply calculated from x(f) : 


y(t) = CD, to) x(to) + f D, 1) B(nu(r) dr |+ D() uA) 


ty 


The only difficulty of this solving method is the calculation of the transition 
matrix. This method often calls upon numeric solving techniques. 


2.2.3. Particular case of linear and invariant systems 


When matrices A, B, C and D are independent of time, the transition matrix 
@(t,toọ) takes a simple particular form because D depends only on the difference 


t—-t0. Indeed, equation D (6.19) = AE, to), with A constant, is resolved by 


analogy with the scalar case by searching for a solution as the sum of an entire 
matrix sequence. 


Ot, to) = À) +A (t-t)+A2(t- t)? fused Ny et vee 
d(£,10) =0  +A,+2A;,(1-10)+:-+k Arlt- LE 


= A[Ao +A; (t-to PAC) A en 455] 


By identifying term to term these two developments, we successively obtain: 


Ay =A Ao, Ay => A Ay, TER Ay => A Ag» EER 
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By considering the fact that ® (tọ, #0) = A0 =I, for ®(¢,t9) we obtain the 
following development: 


® (t, WIHA- 19) AP ++ AG) + 


which is not that of the exponential function in the scalar case. Hence, we 
symbolically mark this development: 


D(t, tg) = et) 


and therefore: 


t 
x(t) = eA) x(t) + f eA B(7) u(r) dr 


lo 


In this particular case of linear and invariant systems, the transition matrix can be 
calculated analytically from, among others, the values of matrix A. We will see 
some of these methods in the following section. 


2.2.4. Calculation method of the transition matrix eAt-to) 


Use of serial development 


When the power calculation of matrix A is done simply (in particular through 
simple recurrences), we can use the definition of the matrix exponential function as 
an entire sequence: 


eM TAAU- HEA? C-t)? tt AE) + 
EXAMPLE 2.2. 
_fo1 2_[0 0 
a=[5 0] = 4?-[o à] 


=> e^) =I+A(t-to) al =) 
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_{0 1 2_|0 -a 
a=|) cl >A | Ki 


> al? a bs ak -0 CT 
ET à 0 (-a)* 


{= eo) 


e^ (E-to) = 14 AG -t0)+ 5 A1) bet AN (tty)! ae = 


0 e to) 


Sylvester formula (distinct eigenvalues of A) 
If 4,,4.---,A, are the n distinct eigenvalues of A (solutions of the equation: 


det(A — AI) = 0), the transition matrix eA) js expressed by: 








n | A-2: 
A(t-to) _ A; (t-to) J 
e = > e | | a 
r -A| 4-1 
i=l j=l 
j#i 
0 1 


0 -a 


eigenvalues of a triangular matrix are presented in its diagonal). 


EXAMPLE 23- A =| | A, =0, A =-a distinct eigenvalues (the 


A-Ajl 
A-A 


At-to)| AAI |, 46-10) AW Al 
A-A n-A 





n Jilt—ty) n 
ET) 
a 


j#i 





2 AAEE |y ao fA] 


O+a -a-0 
1 ee _ 7a t=t) 
Tg es Eg 
a a a 
_ ,Ut-to) _ altto) 
0 lzen t 1 0 1 lze > °” 
= a + = a 
0 1 


0 e to) L] 0 e 4-0) 
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Sylvester interpolation method 


When the eigenvalues of A are not distinct, we cannot use the previous formula. 
Hence, we will do as follows: we will suppose that A, is a simple eigenvalue and 
that A, is a double eigenvalue. Thus, we build a matrix in which the rows of simple 
eigenvalues are of the same type: 





1A a. art gee 
LA 4 a efli) 
01 2% EDA (t-te 
1 As AB - AST eu) 
i À ADS AE] AU) | 





The first row corresponding to a multiple eigenvalue is done in the same way, 
but the following row is the derivative of the current row with respect to the multiple 
eigenvalue (here A, ). If the eigenvalue is double, the next row has a standard form 
with the next eigenvalue. If the eigenvalue is triple, we repeat the procedure and we 
derivate again the second row corresponding to this triple eigenvalue and so on. 
Hence we obtain a matrix of n rows andn + 1 columns which we complete by a last 
row built from the successive powers of matrix A and completed by the transition 
matrix e46) | 


Hence, Sylvester’s method consists of formally calculating the determinant of 
this matrix and of extracting the transition matrix from this calculation by writing 
that this determinant, formally developed with respect to its last row, is zero. 


EXAMPLE 2.4.- if we take the same example: A = k | and build the matrix 


which in our example is limited to: 


1 À e^ F 
1 sA e% (to) _ = —a(t-t,) 
LA A6) I A eA) 


à 
D 
= 


{à 
Q 
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The determinant of this matrix developed in relation with the last row is: 


























0 1 1 1 A(t=t, 1 0 
A=IXx ia eh) — AX 1 e7 4-0) +e (=) x ieg 
A =al- (e0 -A — ae?) 

—a(t-t, ) 
7 _ 7 a(t=ty) 1=e : 
AS ge age elt a 
a 0 et) 


NOTE 2.1.- regarding these last two techniques, we note that the transition matrix 
elt) js expressed by a finite degree polynomial (n — 1) of matrix A. This result 
is due to the fact that any matrix verifies its characteristic equation, det(A — AI) 
(theorem of Cayley Hamilton) and thus all powers of A of a degree more than n are 
expressed by a linear combination of first powers n — 1. 


Method of modes 


When matrix A is diagonalizable (it is at least the case when the eigenvalues of 
A are distinct), we can calculate the transition matrix by using the diagonal form of 
A. 


A =MAM ! > A? = MAMMAM”! = MA2M7!...A* = MA‘m7! 


e AU) =1+A(-10)+ 3 A? (6-10)? ++ Amt) += 


2 2 k k -1 A(t- -1 
M} LAG— 10) + LA (t-to) RAA (t-to) +M = MAM 


However, matrix e^(t-to) , by build, is the diagonal matrix whose diagonal 
elements are the scalar exponential functions e^it). The calculation of the 
transition matrix is then done by determining a system of eigenvectors of 


A({x1; X2," X, }) in order to define the basic change matrix M. 
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EXAMPLE 2.5. 


=a=[) A eigenvalues 1;=0, À,=-a 


= 1 0 
AU A0 Le 


Determining M: 





Ax, =4,x,=0>x,= p (for example) 





AX) =A X70 =a X) >X = E À (for example) 


1 1 
11 -1 a 
M = [x x]= >M = 
0 -a 1 
O —— 
a 
ie 
= eho) = 1 1 1 0 a 
0 —a 0 e ttt) 0 1 


1-e alto) 
e AU) = 1 — 7 
0 et) 


2.2.5. Application to the modeling of linear discrete systems 


Let us consider the equation providing the evolution of a state of a system 


between two instants to and t: 


t 
x(t) =©(t,t9) x(to) + f (t,7) B(T) u(r)dr 


to 
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This equation is valid for any pair of instants fp <t. 
Hence, let us suppose that between two instants #, and f,,, the control u(f) 


applied to the system is constant and equal to u; = u(t). The evolution of the state 
between these two instants can be expressed as follows: 


Tel 
X(tk41) = Oty ay, ty) X(t) + Í D 7) BOdT july, ) 
tk 
teat 
where: O(t; 4 tk) = F(k) and [or T) B(z) dt = G(k) are two functions of k. 
tk 
When instants ¢, are multiples of a sampling period T(t, =kT) and when 


between the two sampling instants the control applied to the system is constant, the 
evolution of this state is: 


x(k+1)=F(k)x(k)+G(k)u(k) 


which represents the discrete model of the continuous system of the equation of 
state: 


x(t) = A(t) x(t) + B(t) u(t) operated by a control: 
u(t) =u(AT) for kT <t<(k+1T. 
Moreover, when the system is invariant, x(t) = Ax(t)+ Bu(r), then we obtain: 
F(k) =e"! =F 
teat thet T 


and: G(k)= f Du, D B dr= f AU -DB dr= f AOB d0=G 


tk ty 0 
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These two constant matrices are independent of k and the discrete model 
becomes: 


x(k +1) = Fx(k) + Gu(k) 


In both cases, the equation of measurement is added to the equation of state in 
order to complete the model. 
2.3. Scalar representation of linear and invariant systems 
2.3.1. State passage — transfer 


Linear and invariant systems are characterized by an external representation that 
has the form of a transfer matrix linking the controls to the outputs: 


Y(p)=H(p) U(p) 


For mono-variable systems (y and u scalar), we define the transfer function of 


f Y 
the system by the ratio: H(p) = FC), 


U(p) 
In the case of multi-variable systems, the transfer is characterized by matrix 
H(p) whose size is linked to the size of output vectors y and control vectors u. If y 


has a size / and u has a size m, transfer matrix H(p) has the size / x m. The problem 
that arises is the passage from an internal representation: 


x(t) = Ax(t)+Bu(t) 
y(t) =Cx(t)+Du(t) 
to the external representation H(p) . 


This passage will be done, in the case of the linear and invariant systems we are 
dealing with, by using the Laplace transform on the equations of state and 
measurement: 


PX(p)-x(t9) =AX(p)+BU(p) => [pI-A] X(p) = x(t)+BU(p) 
=> X(p)=[pl-AT !x(0)+[pl-AT BU(p) 


58 Analysis and Control of Linear Systems 


From the analogy between this transform and the expression of time response: 


t 
x(t) =e“) x(t,) + f eA Bu(r) dr 
lo 
we obtain the following relations: 


[p IA" = Left} 


t 
[p I-AT'B U(p)= Lie) * Bui} = L Í eA“ Bur) dr 


to 


which provide a new calculation technique of the transition matrix: 
eA) — pol llp1-aT'} 


If the initial state is zero (hypothesis used in order to define the transfer function 
of a system), we have: 


X(p)=[pI-A]' BU(p) and Y(p)=[C[p I-AT' B+D] U(p), 
hence we obtain the transfer matrix: 
H(p)=[C[p I-A]! B+D] 


Element H; (p) of matrix H(p) represents the transfer between the control 
u(t) and an output y, (t), when the other controls are at zero. Thus, we can obtain 
from the matrix: 


KG] [+ 
¥(p)=H(p)xU(p) => PP | =f HG LIU: 


Cou eek se, 2 A U„(p) 
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Y;(p) 
U; (p) 





H;(p)= to U,(p)=0 fork 4 j 


NOTE 2.2.— there are systematic and recursive methods of calculating matrix 
[pI - A]! such as the Leverrier-Souriau algorithm; firstly, matrix [pI- A]! is 
written as: 


—1 —2 
[pI - A] =li Bop” +B, p” ++ B, 
dy p' td, pr ltd, 


where: 


dy p" +d, De +--+d, = det[ pI-A] 


which is an expression where the idea is to determine the n square matrices B, 
nxn and the n+1 scalar coefficients d;. 


Hence, we make the following iterative calculations: 


do =1 
By =I dı = —trace{ByA} 
B; = ByA+d I dy = -2 trace{B, A} 
By = B,A+ dI dy = = trace {B.A} 


B; = B; A+ dx dys} = -— trace{ B; A} 
+ 


B„-ı =B,_2A+d,_I d, = Bs trace {B,_; A} 
n 


and the last relation that represents the verification of the calculation and must give: 
B, =B,_, Atd, 1=0 


which ends the calculation of terms defining [pI — A] me 


60 Analysis and Control of Linear Systems 


2.3.2. Change of basis in the state space 

As we have already mentioned, the representation of a state of a system is not 
unique. Only the controls applied to the system and the resulting outputs are the 
physical magnitudes of the systems. Hence, there is an infinite number of internal 


representations of a system which depend on the state vector chosen. 


We will verify it by performing a change of basis in the state space. Let us 
assume that x(t) =M x(t) with M constant and X(t) state of a system in the new 
basis: 


x(t) = M X(x) = Ax(t)+Bu(t) = AM X(1)+Bu(t)> x = M 1AM X()+M! Bu(s) 
X() = AX()+Bu(s) 
y(t) =CX(t)=Ax(t)+Bu(t) =AMX()+Du() yA = C X(N + Dus) 
In the new basis, X(t) = M~! x(t) and: 
X(t) =A X(t) + Bul) 
y(t) =CX(t) + Du(s) 
with: Â=M AM B=M /B C=CM and D=D. 
Hence, let us calculate: 


CMIpi-M AM] (M 'B+D 
= C[Mjpi-M 'AM] M!T!B+D 
C[pI- AJ !B+D = H(p) 


œL 
Il 


A(p)=Clp I-Ay! B+ 


The change of basis did not modify the transfer matrix of the system. 


2.3.3. Transfer passage — state 


We will deal here only with mono-variable systems. Generalizing of multi- 
variable systems is done easily for each scalar transfer function H;;(p). 
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Hence, let us consider a transfer function: 


Y 3 + + 24.4 n-l N 
ioe aan) a ee eee) 


U(p) by tb, ptby p> ++p" ~ D(p) 


We note that the coefficient of the highest degree term of the denominator is 
standardized at b, =1. We also note that the degree of the numerator is at least one 


unit lower than the one of the denominator. If it is in any other way (same degree 


N@) 
D(p) 
decomposed into a full part dọ and a rational fraction of the type shown above. This 





maximum for a physically conceivable system), the rational fraction 


full part d, characterizes the direct transmission of the control and hence represents 
an element of matrix D of the state model. 


Among all the forms of transfer passage — state, we will describe two 
particular forms adapted to problems of control and estimation. 


Companion form for the control 


We will start from the expression: Y(p)= MPs (p). 


D(p) 
Let us assume that: 
Y(p) = N(p)X1(p) with X, (p) -5E 


where X,(p) represents the Laplace transform of x,(¢), the first of n state 
variables constituting x(f) . It is thus possible to write the two following polynomial 
relations: 


D(p) X,(p)= (by + pt bap? ++p") Xi(p)=U(p) 
Y(p)=N(p) Xi(p) = (ag taptap +--+ anap Xp) 
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In the time domain, the previous equations lead to: 


d"x, 


dt” 





(Nb, ELO +b aO =u) 
n-1l 


dx dx 
JO = 4,4 ee | tao x, (0) 


Thus, by choosing as state variables x,(¢) and its (n—1) first derivatives, the 
equations of state and measurement are written: 


xı) =x.) 


X(t) = x3 (£) 


X,1(t) =x, (6) 
X(t) = -bo x(t) — By x) yy x, (£) +u() 


V(t) = dg X(t) +a, x (C++ 44,1 x, (0) 


which lead to the matrix form called companion form for the control whose block 
diagram is given in Figure 2.2. 


a ae 
+1 * ry ae 




















Figure 2.2. Companion form for the control 
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This representation, whose state variables corresponding to this form are 
currently called phase variables (phase plane for n = 2), is summed up by the 
following equation of state and equation of measurement: 


01 0 ..0 A 

0 0 1 0:0 ! 
x0)=]; A x(+)- uO) 

; R. Se . 01 0 

= by =b, =b © -b, ; 


yt)=[a9 q ay] x(t) 


Companion form for the observation 


Let us bring back the fraction form: 


Y(p) _ao+tap+asp +44, ip" _N(p) 
U(p) by +bp+ba p ++p” D(p) 





By dividing N(p) and D(p) by p” , we obtain: 











1 1 
do y +4; PE +a, TA +e tan — 
Y(p)_ p De pe 
U(p) bo L +b, l +b, E Leur) 
P P P 


which leads to: 


1 


n-1 





+a) |U(p) 





Y(p)= a, ybte tay 
P 





n-l 


-byy ntb vy ro 
P p” 


| a2 U(p)-b -2 Y(p) 
Yis | Ay, UO Loe 
(p) à dn-1 (p) 1 t He U(p)- bo ror 
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Hence, through X, (p) = L{x, (9}= Y(p), we obtain: 


1 
Xn (p) = lan- U(p)- bn- Xn(P)+ Xn- (P) 
1 
Xn-1 (p) = p [an-2 U(p) — Dy-2 Xn (p) + Xn-2 (p)] 
X\(p) = [ay U(p)-b X, (PA 


These n equations are directly transcribed into the time domain as follows: 


xı (t) = ay u(t) — bo x, (t) 

x, (= a, u(t)—b, x, +x (0) 

X3(¢) = a, u(t)-b x, (1) + X(t) 

Xn (t)= dy u(t)-b, Xn ()+x, (t) 


Grouped as a matrix, they represent the companion form for the observation: 


00: < -by 7 
100. -=b i 
x()=]91 9° b ele uO 
01 0-b, : 
01-b, A 

Hena 








the equation of measurement being reduced to: 


yt=(0 0 0- 0 xH 


The block diagram of this representation is given in Figure 2.3. 
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t 1 4 t Le 1 | Xn-1 1 | Xn y 
à P k p P p 


sl lent lo 
7 |} ei)? 

















Figure 2.3. Companion form for the observation 


2.3.4. Scalar representation of invariant and linear discrete systems 


In section 2.2.5 we have seen the modeling of discrete linear and invariant 
systems as: 


x(k +1) =F x(k) +Gu(k) 
y(k) =Hx(#)+Ju(k) 


The scalar representation of such systems is obtained by using the transformation 
in z on the equation of state (zero initial conditions): 


z X(z)=FX(z)+GU(z) => [dl-F] X(z) = GU(z) 
> XG)=[1-F] ! GUD 
Y(2)=HX(2)+GU(G) > Y()=[HzA-F] ! G+J] U) = T2) U2) 


This last relation makes it possible to define the transfer matrix of the discrete 
system: 


T(z) = [Hiz - F] !G +J] 


The calculation of matrix [z I-F] ! can be done by using the Leverrier- 
Souriau algorithm which has been described for the calculation of matrix 
[p I-A]7! in section 2.3.1. 
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2.4, Controllability of systems 
2.4.1. General definitions 

A system is controllable between xo and x, if we can find a control u(¢) that 
makes it possible to pass from the initial state x9 to the final state x, within a finite 
period of time. 


A system is entirely controllable if there is a solution u(¢) for any pair {xq,x;}. 


A system is partially controllable if we can operate only on certain components 
of the state of a system between xs and xı. 


EXAMPLE 2.6— let us consider the size 4 mono-variable system defined by the 
equation of state: 


a,0 0 0 bi 

n 0 a0 0 b, 

x(t) = 0 0 a0 x(t) + 0 u(t) 
0 0 0 a, 0 


It is clear that only variables 2,(¢) andx,(f) are controllable by u(t). The 
evolution of variables x3(¢) and x4(t) remains independent of the control. 


2.4.2. Controllability of linear and invariant systems 


Firstly, we will limit ourselves to the case of a mono-variable system whose 
control u(t)is reduced to a scalar term. Hence, we have to define the necessary 
conditions (i.e. the sufficient conditions) so that the system described by the 
equation of state x(t) = A x(t)+ B u(t) is controllable. The linearity of the equation 
makes it possible to calculate for a zero initial state (x(t) = 0). 


If we consider x, € R” , there is a control u(t) so that, for finite t, , we have: 
1 1 


ti ti 
x(t) =e8G xy) + Í MUR u(r) dr= Í eAG-OB u(t) (six(to)=0) 


to to 
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t 
For any instant t, we have x(t) = [ers u(t)dT. 


to 


However, according to the Cayley-Hamilton theorem, e““~” 


matrix polynomial of degree n —1: 


can develop as a 


nal 
END = apr, t) I+, NA+ + OlT, pa"! =) a(r, DA! 
i=0 


because, as we have seen during the calculation of the matrix exponential function, 
A” and all powers more than A are linear combinations of (n — 1) first powers. 


Thus: 
t|n-1 i n-1 | t 
x(t) = f Xal, DA! |Bu(r) ar= Y'A'B] (nt) u(t) dr 
ip Li=0 0 ig 


t 
By supposing that o; = [a (t, tu(r) dT , the previous equation becomes: 


to 
x(t) =B p +AB p; +--+ A" |B, ; 


an expression in which matrices AB are column vectors (like B — mono-variable 
control) and functions ¢,(t) are scalar. 


By using a vector notation: (ft) = ; , we can sum up the previous 
Pn- (t) 
equation as follows: x(t) = [B|AB| JA” Blow) = Q(t), where matrix QG is the 


square matrix nxn built by juxtaposing n column vectors B, AB,---, A””'B. 


This result shows that irrespective of control u(¢), x(f) remains within the 
vector sub-space generated by vectors A'B , for i=0,1,---,n-1. 


68 Analysis and Control of Linear Systems 


To reach any point x(t) of the state space, the n A'B columns must be linearly 
independent. In other words, it is necessary that: 


rank[Qc | =n 


If this condition is satisfied, we have (t) = Qc x(t). 
The solution in p(t) of this problem is unique but there is an infinite number of 
u(t) solutions satisfying: 
t 
p(t) = [a;(r,9 ua) dr 
to 
The control u(t) is defined only by its n “projections” on the functions &;(7,f). 


To conclude, a mono-variable linear and invariant system characterized by its 
equation of state: 


x(t) = Ax(t)+ Bu(t) 
is entirely controllable by control u(t) if and only if the controllability matrix: 
Qc =[B | AB |... | A” B] 


is of rank n (and thus reversible or regular). 
This result can be generalized to the multi-variable case with the following 
adjustments: if dim B = nxm, p;(t) is no longer a scalar function but a vector of 


size m (that of u(t)). Consequently, p(t) is a vector of size nxm. Considering 
these adjustments, the expression: 


x()=[B | AB |---| A” "B] =Q Ga) 


remains valid, but matrix QG is of size nx(nxm). 
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The system will be entirely controllable if matrix QG is of rank n, which means 
that we can extract linearly independent columns from: 


Qc =IB | AB |- | A” "B] 


2.4.3. Canonic representation of partially controllable systems 


When rank [Qg]=q<n, the relation x(t) =B po + AB p; +--+A"7B0, 
limits the state evolution x(t) of the system to a vectorial sub-space of size q 
generated by the independent columns of matrix Qg . Hence, we will seek a basis of 
the state space where the equation of state makes it possible to split the state vector 
into an entirely controllable part of size g and a non-controllable part of size n — q. 
Thus, let us choose a basis of the vector sub-space consisting of q linear independent 
combinations of q independent vectors of Qg. 


Let T; of size nxq be the matrix built from these q column vectors. 


Let us build the basis change matrix T (so that x(t) =Tx(¢)) by completing 
block T, by a block T, of size nx(n-q) so that: 


T= [T, | Th]\n is reversible 
>> 


q n-q 


In section 2.3.2 we have seen that in a basis change the new equations of state 
and measurement of the system are: 


X() =T AT X()+T'Bu(t) = A X(t) +Bu(s) 


y(t) =CT X(t) + Du(t) =C X(t) + Dus) 


In order to calculate matrices A,B and Č , we will use the following property: let 
M=[M, |M; |---| M,.]< »row be a matrix extracted from QG by linear 





r columns (r<q) 
combinations of the independent columns of QG. By construction, columns 
M; of M are linked to columns T,; of T) through a relation of the type: 
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q 
M; =$ ky T= kj itka Tat tkg Tig for: i=1,2 r (r<q) 
j=l 


This expression translated as a matrix becomes: 


ki 
kin 
M; =[Ti | To |---| Tig] 








By grouping the M, in order to build matrix M, we obtain: 
M=T,K =[M; | Mp |= |M,] 


To conclude, if M(dimnxr) is built from linear combinations of the columns of 
matrix QG, then there is a matrix K (dimgxr) such that: 


M=T, K 
UXT nxq PT 


We will use this property to calculate matrices A,B and Č . 
À=T !AT-=T !AIT,|T,]=T ![AT,|AT,] 


However, AT; (nxq) is, by construction, extracted from QG > 4K,(qxq) so 
that AT, =T, K,, hence: 


T''AT=T [T] K] | AT,]=[T'T,K,| TAT] 





> 


q n=q q n—q 


but T T=T'[T, |D]=T q |T} BL, 
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Block Tr can thus be written: 


K; 


I “3 
TTT, share =>A-=|-- 


TAT, 
Ln-q 0 


— 


q 


In canonical form: 


Ra Pai! AB Yq 
0 A Ln-q 


q n-q 
Likewise, B=T! B andB is extracted from QG: 


= 3K ,(qxm) so thatB=T, K, 


ds K, || 
> B=T!' T, K,=| 2| 
0 \ln-q 


m 


Matrix C does not have any particular characteristic. 


In the new basis, the equations of state and measurement have the following 
canonical form: 


O= Ah sAn X(t) + pi u(t) 

0 A» 0 
yt)=[C, Cy] &()+D us) 

This form corresponds to a partitioning of the state vector into two sub-vectors 


X,(t), X(t) representing the controllable part and the non-controllable part of state 
X(t) . The equations developed from this partitioning are: 


X(t) = Ay, X (0+ Ap X(t) +B, u(t) controllable part (size q) 
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re (£) = Ay X, (t) non-controllable part (size n — q) 


EXAMPLE 2.7.- let us consider the system described by the equations: 


-1 00 0 1 

X(t) = ne: x(t) + ° u(t) y(t)=[0 0 1 0] x(t) 
-1 0 0 1 1 
0000 0 


Is this system controllable? 


1 -1 +1 -1 
0 +1 -1 +1 : 

Qc = mars rank QG =2 (columns (2), (3) and (4) are linked). 
0 0 0 0 


T=[T, | T2] Ty linear combination of independent columns of Qg. 


T; = 


O = © = 
oo =- © 
- 

Il 
O = © = 
oo = © 
o=- o © 
re © © © 


T, is chosen in order to least disturb the initial state structure and in particular in 
order to maintain, for the new state variables, a significant physical nature. 


1 0!0 0 -1 0!0 0 
| | 
| sO AO: 0 > 1 0!1 0 
T! = i >  À=T"AT| ----- ae 
-1 011 0 0 010 1 
0 010 1 0 010 0 
B=T 'B= and C=C T= [1 0 1 0] 


AE + 
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In the new basis, we thus obtain the canonical form for the following 
controllability: 


-1 010 0 1 
O= e eee X()+ g u(t) 

00:01 0 

0 010 0 0 


y=L1 011 0 JO 
where we note the blocks of zeros characteristic for this form in A and B. 
2.4.4. Scalar representation of partially controllable systems 


From the canonical form described above, we can determine the transfer matrix 
of the system as done in section 2.3.2: 
B 
lip 
0 


By using the triangular nature per blocks of the matrix to reverse, we obtain: 


` ae Mu I -å -A 
H(p)=C[pI-A]'B+D=[C, C5] FE eu 12 





0 Ply_g ~ À» 


. [PL -1 Mio Bı 


=[Č C3] ane’ +D 
[pI nq -An] 0 


which is a form where the calculation of matrix M, is useless because it appears 
only in the final result: 


H(p) = Cl, -A,,]7' B,+D 


We note on this form that the degree in p of the transfer matrix is reduced a 
priori to q instead of n. Hence, the scalar response represents only the controllable 
part of the state. 
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2.5. Observability of systems 
2.5.1. General definitions 


A system is observable if the observation of measurement y(t) during a finite 
period of time (to, t,) makes it possible to determine the state vector at instant fo . 


A system is entirely or partially observable depending on whether we can build 
all or a part of state x(¢g). 
2.5.2. Observability of linear and invariant systems 


Firstly, we will suppose that measurement y(t) performed on the system is a 
scalar measurement and we will deal here with the following problem. 


Let us consider a free state system described by equation of state x(t) = A x(t) 
and for which we notice the scalar output y(t) = C x(t) (dim C =1xn). 


If x, is the initial state x(0) (tọ =0), how can we find state x, from the 
observation of y(t) during the finite period of time [0, 4]? We will answer this 
question by formulating output y(t) according to xq. 


We have: 
n-1 } n-l P 
x(t) = exp = D a; (0)A‘xo and thus: y(t) = e“/xy = D a; (CA'Xo 
i=0 i=0 


By forming the scalar products: 


h 


<a, (A >= [est NO dt =z; 
0 


n-l( 4 
<a, yO >= >) Í a(t) a(t) dt| CA! xo 
i=0\ 0 


we obtain: 
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n-1 


Zk =< a, (t), y(t) >= Dax CA! Xo 




















i=0 
t 
with: ay = Í a, (t) à, (t) dt 
0 
and in a matrix form: 
Zo 499 41 © © * And C 
zi ao] ; m eh os CA 
CA? 
— z XQ 
n-l 
[Zn] [on ` © dy nly CA’ | 
size: n x 1 nxn nxnnxl 


We can show that, by construction, matrix {aj} is regular. Thus, a necessary and 
sufficient condition for the system to be observable (thus to be able to uniquely 
extract x, from the equation below) is that the observability matrix defined by: 





C 
CA 


(and of size nxn here) is of rank n and we obtain: 
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C aoo 41 © © * Aon Zo 
CA aol : Ben smc A zi 
CA? 
Xo = > 
n-l 
[CA J [don ° ay ~ an-ı n=l | LZn-1 | 




















EXAMPLE 2.8.— let us consider a system represented by equations of state and 
measurement in companion form for the observation: 


[ 0 0 -bo ] [ ao ] 
100 -b ay 
010... -b 
x(t) = x(t) + u(t) 
0 1 0 -b, > 
0 1 —b,_ | L an- | 














and y(t)=[0 0 0 -0 1] x(t). 


The calculation of observability matrix: 








IC | Fo 0 - - 0 1] 

CA 0 : : O 1 * 

CA? s Se Me 
Qo = leads to: Qo = | 

| 0 1 * 

ca"! [1 A ag e PE OR | 








matrix of rank n by construction. The companion form for the observation represents 
entirely observable systems through the inputs. Like in the case of controllability, 
this result can be generalized to the multi-variable case for which observability 
matrix Qọ is of size ni xn. 
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The observability test becomes: 


C 
CA 


2 
CA 
rank|. =n 








CA”! 


nlxn 


2.5.3. Case of partially observable systems 


When matrix Q, is not of full rank, the system is not entirely controllable and 
only a part of the state vector is reconstructible from the observations. If rank 
Qo =r <n, we can show (as we did for the canonical form for controllability) that 
there is a basis of the state space where the system admits the following canonical 
representation: 


ï= A ae ie 
| Es Pag ro 


y(t) = [č 0 o+put 


However, the basis change matrix T (x(t)=T X(r)) is obtained differently; we 
build T~' (and not T) into two blocks: 


gi qT’; 
TD 


The first block (T !), is obtained by linear combinations of independent rows of 
matrix Qg and completed by block (T) so that the ensemble is a regular matrix. 
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Once this matrix T™! is calculated, we determine the matrices in the new basis 
like we did above, for the canonical form for controllability: 


A=T AT B=T"'B C=CT 


The canonical form thus obtained entails a split of the state vector into two 
parties: 


— X, (£) of size r which represents the entirely observable part of X(t); 


— X, (£) of size n—r which represents the non-observable part of X(t). 


As shown in the equations developed below (by ignoring the term of control), 
YO) = Ci X (2): 
= Š (£) = Au Yı (t) entirely observable part (size r); 


- X> (t)= A> x, (t)+ Ay» X, (t) non-observable part (size n — r). 


2.5.4. Case of partially controllable and partially observable systems 


If rank QG =q<n and rank Qg =r <n, we show that there is a basis of the 
state space where the system admits the following minimum canonical 
representation: 


— X, (t) represents the controllable , non-observable part of x(t); 


— X, (t) represents the controllable, observable part of X(t); 


— X, (t) represents the non-controllable, non-observable part of x(t); 


— X4(¢) represents the non-controllable, observable part of x(t). 


Ay Ap Ag Aig EN 


T B; 
x(t) + u(t) 
0 


0 än 9 Arg 


Ke 
Il 








o0 0 0 “ae | AIO 


y(t) =| 0 C) 0 Cy IX()+Du(ć) 
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Chapter 3 


Discrete-Time Systems 


3.1. Introduction 


Generally, a signal is a function (or distribution) with support in the time space 
T, and with value in the vector space E, which is defined on R. Depending on 
whether we have a continuous-time signal or a discrete-time signal, the time space 
can be identified with the set of real numbers R or with the set of integers of Z. A 
discrete system is a system which transforms a discrete signal, noted by u, into a 
discrete signal noted by y. The class of systems studied in this chapter is the class of 
time-invariant and linear discrete (DLTI) systems. Such systems can be described by 
the recurrent equations [3.1] or [3.2]!: 


x(k +1) = Ax(k) + Bu(k) [3.1] 
y(k) = Cx(k) + Du(k) | 
y(k) +++-a, Wk —n) = bou(k)+---b,u(k-—n) [3.2] 


where signals u, x and y are sequences with support in Z (ke Z) and with value 
in R”,R" and R? respectively. They represent the input, the state and the output 
of the system (see the notations used in Chapters 2 and 3). A,B,C,D,a;,b; are 
appropriate size matrices with coefficients in R: 





Chapter written by Philippe CHEVREL. 
1 We can show the equivalence of these two types of representations (see Chapter 2). 
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der Be R®™ Ce RP”, De RP” wer? ber [3.3] 


If equations [3.1] and [3.2] can represent intrinsically discrete systems, such as a 
u-processor or certain economic systems, they are, most often, the result of 
discretization of continuous processes. In fact, let us consider the block diagram of 
an automated process, through a computer control (see Figure 3.1). Seen from the 
computer, the process to control, which is supplied with its upstream digital-analog 
and downstream analog-digital converters (ADC), is a discrete system that converts 
the discrete signal u into a discrete signal y. This explains the importance of the 
discrete system theory and its development, which is parallel to the development of 
digital u-computers. 


Ge gua 
0 1 2 3 4 








t t 
p- 0 pe 
k 0 1 2 3 4 k 


u(k) u(t) | Continuous | Ya(t) y(k) 
HO of ET 2 


cu 


Figure 3.1. Computer control 




















This chapter consists of three distinct parts. The analysis and manipulation of 
signals and discrete-time systems are presented in sections 3.2 and 3.3. The 
discretization of continuous-time systems and certain concepts of the sampling 
theory are dealt with in section 3.4. 
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3.2. Discrete signals: analysis and manipulation 
3.2.1. Representation of a discrete signal 


A discrete-time signal? is a function x(.) with support in T = Z and with value in 
E=R",neN. 


We will talk of a scalar signal if n = 1, of a vector signal in the contrary case and 
of a causal signal if x(k) =0, Vke Z . Only causal signals will be considered in 
what follows. There are several ways to describe them: either explicitly, through an 
analytic expression (or by tabulation), like in the case of elementary signals defined 
by equations [3.4] to [3.6], or, implicitly, as a solution of a recurrent equation (see 
equation [3.7]): 


1ifk=0 
Discrete impulse?: Ô(k) = i [3.4] 
0 ifkez 
: : 1 ifke Zt 
Unit-step function: T(k)= [3.5] 
0 ifkeZ ” 
l at ifke Z™ 
Geometrical sequence: g(k) = [3.6] 
0 ifkeZz™ 


It will be easily verified that the solution of equation [3.7] is the geometrical 
sequence [3.6] previously defined. Hence, the geometrical sequence has, for 
discrete-time signals, a role similar to the role of the exponential function for 
continuous-time signals. 


| i x(k +1) =ax(k) 
First order recurrent equation: [3.7] 
x(0) =1 





2 Unlike a continuous-time signal, which is a function with real number support (T = R). 
3 We note that if the continuous-time impulse or Dirac impulse is defined only in the 
distribution sense, it goes differently for the discrete impulse. 
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3.2.2. Delay and lead operators 


The concept of an operator is interesting because it enables a compact 
formulation of the description of signals and systems. The manipulation of 
difference equations especially leads back to a purely algebraic problem. 


We will call “operator’ the formal tool that makes it possible to univocally 
associate with any signal x(-) with support in T another signal y(-), itself with 
support in T. As an example we can mention the “lead” operator, noted by q 
[AST 84]. Defined by equation [3.8], it has a role similar to that of the “derived” 
operator for continuous-time signals. The delay operator is noted by q` for obvious 
reasons (identity operator: 1=q 0 q! ). 





x: TOE gx: TOE x: TOE q'ix: TOE 
S [3.8] > 
k > x(k) k > x(k +1) k > x(k) k— x(k-1) 














Table 3.1. Backwards-forwards shift operators 


Any operator f is called linear if and only if it converts the entire sequence 
x(k) + Axg(k), AER into the sequence y(k)+Ayo(k) with y,=f(x,) and 
y2 =f (%2). 


It is called stationary if it converts any entire delayed or advanced sequence 
x(k-r), rez into the sequence y(k-r), with y=f (x) (formally, 
Sq" x)=" fo). 


The gain of the operator is induced by the standard used in the space of the 
signals considered (for example, Lz or L..). The gain of the lead operator is unitary. 


These definitions will be useful in section 3.3. Except for the lead operator, 


and operator w=(q +1)7! o(q—1) will be used sometimes. 





i 
operator ôr 2 
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3.2.3. z-transform 


3.2.3.1. Definition 

The z-transform represents one of the main tools for the analysis of signals and 
discrete systems. It is the discrete-time counterpart of the Laplace transform. The z- 
transform of the sequence {x(k)}, noted by X(z), is the bound, when it exists, of 
the sequence: X (DES x(k) where z is a variable belonging to the complex 


plan. 


For a causal signal, the z-transform is given by [3.9] and we can define the 
convergence radius R of the sequence (the sequence is assumed to be entirely 
convergent for |z|>R ). 


XD Z{x(k) 1 yah) at  |z|>R [3.9] 
0 


X(z) is the function that generates the numeric sequence {x(k)}. We will easily 
prove the results of Table 3.2. 
































x(k) X(z) R x(k) X(z) R 
(k) 1 co a‘ T(k) 4 a 

Z-a 

in © 

Tk = 1 k sin(@ K(k = 
® z-l ee) 2 —(2acosw)z+ a’ 

2 
kT(k) S| 1 | ak cosmo) — seats > | la 
(=I) z —(2acos@)z+a 


























Table 3.2. Table of transforms 


3.2.3.2. Inverse transform 


The inverse transform of X(z), which is a rational fraction in z, can be obtained 
for the simple forms by simply reading through the table. In more complicated 
cases, a previous decomposition into simple elements is necessary. We can also 
calculate the sequence development of X(z) by polynomial division according to 
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the decreasing powers of z! or apply the method of deviations, starting from the 
definition of the inverse transform: 


x(k) = ZT (X(2)) = a [X@z*az [3.10] 
C 


where C is a circle centered on 0 including the poles of X(z). 


3.2.3.3. Properties of the z-transform* 


We will also show, with no difficulties (as an exercise), the various properties of 
the z-transform that can be found below. The convergence rays of the different 
sequences are mentioned. We note by R, the convergence ray of the sequence 
associated with the causal sequence x(k). 


P1: z-transform is linear (R =max(R,,R,,)) 


ax+by 
Z({ax(k) + by(k)}) = aX(z)+ bY(z), Va,be R 
P2: delay theorem ( kr =R,) 


Ziq’ x=" X(z), Vrez* 


P3: lead theorem ( R p „= R,) 
n-l 
Z({q" x(k)}) = of = dx! | VneZ* 
0 


In particular: Z({x(k + 1)}) = zX(z)- x(0) 
P4: initial value theorem 


If x(k) has X(z) as a transform and if lim X(z) exists, then: 
Z—oo 


x(0) = lim X(z) 


4 Note: the various manipulated signals are assumed to be causal. 
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P5: final value theorem 


If lim x(k) exists, then: lim x(k) = lim (l—z!)X(z) 
k—c k—c zal 


P6: discrete convolution theorem ( Ry :,, 


= max(R y, Ry )) 


Let us consider two causal signals xı (k) and x(k) and their convolution integral 


xı *¥ X(N) = re -k)x (k) = vu (n—k)x>(k). We have: 


k=—00 k=0 


Zu * XQ (M)}) = X1) X22) 
P7: multiplication by k (Ry, = R,) 


Heer 
dz 


P8: multiplication by a* (Ry = |a| R,) 


Z({a* x(k)}) = X(a°!2) 


3.2.3.4. Relations between the Fourier-Laplace transforms and the z-transform 


The aim of this section is not to describe in detail the theory pertaining to the 
Fourier transform. More information on this theory can be found in [ROU 92]. Only 
the definitions are mentioned here, that enable us to make the comparison between 
the various transforms. 














Continuous signal: x (?) Discrete signal: x(k) 
eo +o ; 
Fourier transform Xr(Q)= fx edt | Xp@)= > x(k) e Jak 
as k=—00 
co +co + 
Laplace transform/ Xe [xy (t) e Pd | X(z)= X x(k) z 
z-transform pe k=-00 
pE C zEC 














Table 3.3. Synthesis of the various transforms 
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Hence, if we suppose that X(z) exists for z= el ® , the signal discrete Fourier 
transform x(k) is given by X-(@) = X(e/”), whereas in the continuous case, © is 
a homogenous impulse at a time inverse, the discrete impulse wy (also called 
reduced impulse) is adimensional. The relations between the two transforms will 
become more obvious in section 3.4 where the discrete signal is obtained through 
the sampling of the continuous signal. 


3.3. Discrete systems (DLTI) 


A discrete system is a system that converts an incoming data sequence u(k) into 
an outgoing sequence y(k). Formally, we can assign an operator f that transforms 


the signal u into a signal y (y(k)= f(u)(k), Yke Z). The system is called linear 


if the operator assigned is linear. It is stationary or time-invariant if f is stationary 

(see section 3.2). It is causal if the output at instant k =n depends only on the 

inputs at previous instants k <n. It is called BIBO-stable if for any bound-input 

corresponds a bound-output and this, irrespective of the initial conditions. Formally: 

(sup u(k) < œ= sup (fu)(k)<co). In this chapter we will consider only time- 
k k 


invariant linear discrete systems. Different types of representations can be 
envisaged. 


3.3.1. External representation 


The representation of a system with the help of relations between its only inputs 
and outputs is called external. 


3.3.1.1. Systems defined by a difference equation 


Discrete systems can be described by difference equations, which, for a DLTI 
system, have the form: 


yk) + +++, Wk — n) = bou(k) + -b ulk — n) [3.11] 


We will verify, without difficulty, that such a system is linear and time-invariant 
(see the definition below). The coefficient in y(k) is chosen as unitary in order to 
ensure for the system the property of causality (only the past and present inputs 
affect the output at instant k). The order of the system is the order of the difference 
equation, i.e. the number of past output samples necessary for the calculation of the 
present output sample. From the initial conditions y(—1),---,y(—n), it is easy to 
recursively calculate the output of the system at instant k. 
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3.3.1.2. Representation using the impulse response 


Any signal u(-) can be decomposed into a sum of impulses suitably weighted and 
shifted: 


u(k)= Yuli) 5(k-i) 


[=—0o 


On the other hand, let h(-) be the signal that represents the impulse response of 
the system (formally: h = f(6)). The response of the system to signal q” is q‘h 
due to the property of stationarity. Hence, linearity leads to the following relation: 


y(k) = Sui h(k-i)= SA) u(k—i) =h*u(k) [3.12] 


j=—co j=—co 


The output of the system is expressed thus as the convolution integral of the 
impulse response h and of the input signal u. We can easily show that the system is 
causal if and only if A(k) =0, VA <0. In addition, it is BIBO-stable if and only if 


¥|Ali)|< oo. 
i=0 


3.3.2. Internal representation 


In section 3.3.1.1 we saw that a difference equation of order n would require n 
initial conditions in order to be resolved. In other words, these initial conditions 
characterize the initial state of the system. In general, the instantaneous state 
x(k)e R” sums up the past of the system and makes it possible to predict its future. 
From the point of view of simulation, the size of x(k) is also the number of variables 
to memorize for each iteration. Based on the recurrent equation [3.11], the state vector 
can be constituted from the past input and output samples. For example, let us define 
the i" component of x(k), x;(k), through the relation: 


x(k) =) [bulk - j+i-1)-ajy(k- j+i-D) [3.13] 
j=i 
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Then we verify that the state vector satisfies the recurrent relation of first order 
[3.14a] called equation of state and that the system output is obtained from the 
observation equation [3.14b]: 


-q 1 0 0 bi 
x(k+1) = AE: x(k)+ | u(k) [3.14a] 
0 0 1 : 
-a, 0 0 by 
a a See 
y(k)=(1 0 +: O)x(kK) +b u(k) [3.14b] 
a D 


We note that the iterative calculation of y(k) requires only the initial state 
x(0)2xq (obtained according to [3.13] from C.I. {y(-1),-, y(-n)}) and the past 
input samples {u(i), O<i< k} . As in the continuous case, this state representations 
is defined only for a basis change and the choice of its parameterization is not 
without incidence on the number of calculations to perform. In addition, the 
characterization of structural properties introduced in the context of continuous-time 
systems (see Chapters 2 and 4), such as controllability or observability, are valid 
here. 


The evolution of the system output according to the input applied and initial 
conditions is simply obtained by solving [3.14]: 


k- : 
y(k) = CA* xp + S CA gufi) + Du(k) [3.15] 
y,(k) = 





»,() 


y(k) and yp(k) designate respectively the free response and the forced response of 
the system. Unlike the continuous case, the solution involves a sum, and not an 
integration, of powers of A and not a matrix exponential function. Each component 
x;(k) of the free response can be expressed as a linear combination of terms, such as 
p(k) Ae , where 9;(-) is a polynomial of an order equal ton; —1, where n; is the 
multiplicity order of A; and i" the eigenvalue of A. 


5 A canonical form called controllable companion. 
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Based on the previous definitions, the system is necessarily BIBO-stable if the 
spectral ray of A, 0(A), is lower than the unit (i.e. all values of A are included in the 
unit disc). The other way round is true only if the triplet (A,B,C) is controllable and 
observable, i.e. if the realization considered is minimal. If (A) is strictly lower 
than 1, the system is called asymptotically stable, i.e. it verifies the following 
property: 


u()=0 = lim }x(k)]=0, Vx 
k—+00 


Another way to verify that this property is satisfied is to use Lyapunov’s theory 
for the discrete-time systems. The next result is close to the result for continuous- 
time systems in Chapter 2. 


THEOREM 3.1— the system described by the recurrence x(k+1) = Ax(k), 
x(0) = xo is asymptotically stable if and only if: 


JQ = o7 >0 and 3P = PT >0 solution of equations: ATPA-P= Q 





3.3.3. Representation in terms of operator 


The description and manipulation of systems as well as passing from one type of 
representation to another can be standardized in a compact manner by using the 
concept of operator introduced in section 3.2.2. 


Let us suppose, in order to simplify, that signals u and y as causal. Hence, we 
will be interested only in the evolution of the system starting from zero initial 
conditions. In this case we can identify the manipulations on the systems to 
operations in the body of rational fractions whose variable is an operator. The lead 
operator q and the mutual operator g ! are natural and hence very dispersed. Starting, 
successively, from representations [3.11], [3.12] and [3.15], we obtain expressions 
[3.16], [3.17] and [3.18] of operator H(q) which are characteristic for system 
y(k) = H(qu(k) ): 


=A n 
OS bo +- +b,q 4 bog +--+, 





z J [3.16] 
Ita te-+a,q"  qg"+ag" ++a, 





6 Called a discrete-time Lyupanov equation. N.B.: O>0 << Q is defined positive. 
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+00 _ + ; 
H(g)= hq” =E hiq [3.17] 
j=—00 i=0 
see . . 
H(g)= C(ql- A) 'B+D= D CAT BG" +D [3.18] 7 
i=l 


The relation between the various representations is thus clarified: 
h(0)=b =D, hd) =—-ajbo+b,=CB, =- h(k)= CHB [3.19] 


The use of this formalism makes it possible to reduce the serialization or 
parallelization of two systems to an algebraic manipulation on the associated 
operators. This property is illustrated in Figure 3.2. In general, we can define an 
algebra of diagrams which makes it possible to reduce the complexity of a defined 
system from interconnected sub-systems. 


2 Y> & = > H°H — 


Series connection 
i ee 
Oe S | 
> | He | Parallel connection 
1 
2 














y D > 4 (1 + Hj°H) LH; y 
Retroaction 





Figure 3.2. Interconnected systems 


7 The system is causal. 
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NOTE 3.1.— acknowledging the initial conditions, which is natural in the state 
formalism and more suitable to the requirements of the control engineer, makes 
certain algebraic manipulations illicit. This point is not detailed here but for further 
details see [BOU 94, QUA 99]. 


THEOREM 3.2— a rational SLDI, i.e. that can be described by [3.16], is BIBO- 
stable if and only if one of the following propositions is verified: 


— the poles of the reduced form of H(q) are in modules strictly less than 1; 
— the sequence h(k) is completely convergent; 


— the state matrix A of any minimal realization of H(q) has all its values strictly 
less than I in module. 


These propositions can be extended to the case of a multi-input/multi-output 
DLTI system (see [CHE 99]). 


NOTE 3.2.— the Jury criterion (1962) [JUR 64] makes it possible to obtain the 
stability of the system [3.16] without explicitly calculating the poles of H(q), by 
simple examination of coefficients a,a7,---,a, with the help of Table 3.4. 





ag a a) > Ay ÿ > | 
ş un — d dı dn — a 
an An ag with: bọ = = n h = 170 - n-1 în 
bo bi 0 0 
drag Tay A,_ 
ba- bo p, = L0 n fnk 
a 
Co ĉi 
c = by bo =D Dn-1-k 
n-l Ck = —— 
bo 


Table 3.4. Jury table 


The first row is the simple copy of coefficients of a&,,a2,---,a, , the second row 
reiterates these coefficients inversely, the third row is obtained from the first two by 
calculating in turns the determinant formed by columns 1 and n, 2 and n, etc. (see 
expression of by), the fourth row reiterates the coefficients of the third row in 
inverse order, etc. The system is stable if and only if the first coefficients of the odd 
rows of the table (ag,9,cg, etc.) are all strictly positive. 
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NOTE 3.3.— the class of rational systems that can be described by [3.16] or [3.18] is 
a sub-class of DLTI systems. To be certain of this, let us consider the system 


characterized by the irrational transfer: H(g)=1In(1+ q’) . This DLTI system, 
whose impulse response is zero in 0 and such that A(k) = A for ke Z* cannot be 
descibed by [3.16] or [3.18]. 


The use of lead and delay operators is not universal. Certain motivations that will 
be mentioned in section 3.4 will lead to sometimes prefer other operators [GEV 93, 
MID 90]. 


Use of operator 6, 


-1 ; : : 
Operator ô, = aes , TE R [MID 90] represents an interesting alternative to the 
T 


lead operator. It is easy to pass from parameterized transfer H(q) by coefficients 
{a;, b;, i€ {l,---n}} to parameterized transfer H;(ô) by coefficients 
{as bs, ie {l,---n}}. Then we can work exclusively with this operator and 


obtain, by analogy with [3.14], a realization in the state space of the form: 





-45 1 0 0 bs, 
: 0 0 : 
ô,x(k) = 001 x(k)+ : u(k) [3.20a] 
-aş 0 0 bs, 
—— — 
As Bs 
y(k) =(1 0 > O)x(k)+ bs, u(k) [3.20b] 
— Le 
Cs Ds 


However, the simulation of this system requires a supplementary stage 
consisting of calculating at each iteration x(k +1) = x(k)+ T ô x(k). Finally, from the 
point of view of simulation, the parameterization of the system according to 
matrices As,Bs,Cs5,Ds5 differs from the usual parameterization only by the 
addition of the intermediary variable x(k) in the calculations. We easily 
reciprocally pass to the representation in g by writing: 


an [3.21] 
y 


k) = Csx(k + Dsu(k) 
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Hence, we have the equivalences: 


Ae (I+7 4s) BovBs 


COCs D e Ds [3.22] 


A +00 A : 
H,(q) © H5(6)=Cs(d - As)" Bs + Ds = DC545' 'B3d! + Ds 
i=l 


Combined use of operators y and ô; 


We use, this time together, operator 5,, which was previously defined, and 


T° 





+1. : 
operator y = T7 in order to describe the recurrence: 


as = Ayyx(k)+B,u(k) [3.23] 


y(k) = CAEN D,u(k) 


where matrices’ 4,,B,,C,,D, are linked to matrices 4,B,C,D of the q 
representation based on equation [3.24]. We note that the condition of reversibility 
of matrix (4+/) is required and that this condition is always satisfied if the 
discretized system is the result of the discretization of a continuous system (see 
section 3.4.3). 

d 


A, =2(A-D (At B, =—(A+D "|B [3.24] 
T 


E 
c, =c D, =D 


Representations [3.20] and [3.23] have certain advantages over the q 
representation that will be presented in section 3.4.6. We should underline from now 
that the “y-ô,” representation makes it possible to unify many results of the 
theory of systems traditionally obtained through different paths, depending on 
whether we deal with continuous or discrete-time systems [RAB 00]. In particular, 
the theorem of stability (see Theorem 3.1) is expressed as in continuous-time. 


THEOREM 3.3.— the system described by the recurrence 
O,x(k) = Ayy x(k), yx(0)= zo is asymptotically stable if and only if JO = on >0 
and IP = P" >0 solution of equation®: ATP + PA, = -Q. 


8 Index w is used here in order to establish the relation with the W transform [FRA 92] and 
the operator: w, = y! o ô, [RAB 00]. 
9 We recognize here a Lyapunov continuous-time equation. 
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3.3.4. Transfer function and frequency response 
Let us consider a stable system defined through its impulse response A(.) or by 


+00 : 

operator H(q)= Y h(i)q™ . We assume it is excited through the sinusoidal input of 
i=0 

reduced impulse æg: u(k) = el, 


The output is obtained by applying the theorem of discrete convolution: 


+00 +00 
y(k)= > He = D ai [eamas (age ITAR) 
1=0 1=0 


H(eJ@) 


Hence, the output of a DLTI system excited by a sinusoidal input of impulse ø, 
is a sinusoidal signal of the same impulse. It is amplified by factor 


K( y) = HG )| and phase shifted of angle @(® 7) = arg H(e}° ). Very naturally, 





its static gainl0 g.,, is obtained for w,=0 or in the same way z=1: 


8stq = K(0) = H(1). We note that H(el2) is the discrete Fourier transform of the 
impulse response of the system and it can be obtained (see Table 3.3) from its z- 


transform, H(z) for z= eJ°« We will often talk of transfer function in order to 
arbitrarily designate H(z) or H(q). However, it is important to keep in mind that 


H(q) is an operator, whereas H(z) is a complex number. 


The drawing of module and of phase of H (e 9) according to wy represents the 


Bode diagram of discrete systems. For the discrete case, there are no simple 
asymptotic drawings, which largely limits its use from the point of view of the 


design. In addition, the periodicity of function elu , of period 27 , induces that of 
frequency response H (ef ). This property should not seem mysterious. It simply 


results from the fact that fu (k) =e! (TOR [=1,2,:..} represent in reality 


different ways of writing the same signal. We even speak of “alias” in this case. The 
response of the system to each of these aliases is thus identical. In addition, it is 





10 Ratio between input and output in static state (u(k) = 1= el ,VkEZ). 
@,=0 
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easily proven that module K(w,) = K ela) and phase ọ(@4)= arg H(e!*) are 





respectively even and odd functions of æ . The place of the Bode diagram is to draw 
(by using the PC in practice) only on the interval [0,7]. However, an approximate 
drawing can be obtained by applying the rules of asymptotic drawing presented in 
the context of continuous-time systems (see Chapter 1), by using a rational 


eset j : 1+W 
approximation ofz = e7? . We will use for example the W transform, z «> ; 





1 : : i 
and we will draw H | in the place and instead of H( ee), 
—W : 
W=jQ, 


Gain (dB) 





Phase (degree) 








10-! 109 P” 
Og 


Figure 3.3. Bode diagram 


Let us consider the case of a first order system given by its transfer function 





1- 
H(2) = zat al <1. The bandwidth of the Bode diagram drawn in Figure 3.3 is 
z-a 


more important if a is “small”. This result can be linked to the time response of this 
same system studied in the next section. 
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In addition, the frequency response from the 6 transfers can be written: 


jog jog 
e -1 e -1 
nf 7 ] and po wets] 7 | 


K(@) = 











3.3.5. Time response of basic systems 
A DLTI system of an arbitrarily high order can be decomposed into serialization 
or parallelization of first and second order systems (see Chapter 1). Hence, it is 


interesting to outline the characteristics of these two basic systems. 


3.3.5.1. First order system 





Let us consider the first order system described by: y(k) = u(k). We can 


q-a 
associate with it: 


— the recurrent equation: y(k +1) = ay(k) + bu(k) ; 


— the impulse response: A(k) = bal ke N'and h(0) =0; 


— the unit-step response!!: y(x) = —— = a‘), keN. 
=ü 


3.3.5.2. Second order system 


Let us consider the second order system described by: y(k) = Suunn 


u(k). 
q +ag+a 
— m 

H(q) 


We can associate with it the recurrent equation: 


y(k + 2) =—-ayy(k +1) -a,y(k) + biu(k +1) + bou(k) 





11 It can be obtained either from the recurrent equation or by inverse z-transform of: 


Y(z)= ro , with T(z) = = $ 
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In addition, if we note by pg, and by py2 the poles and zy, the zero of H(q), 
we have, if the poles are conjugated complex numbers pq, = pe!’ and 
= pe I?i th if- : 
Pdi = pe , the unit-step response: 


y(k) = H(D +a p* sin(wyk +ø) 


E 
bi Jeans a) 


with: œ = — a 
pP Ssn @ p^ -2pcosw+l 





More generally, and according to the situation of the poles, there are various 
types of unit-step responses (see Figure 3.4), which are stable or unstable depending 
on whether the poles belong or not to the unit disc. 








1.5 06e 0. 
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Figure 3.4. Relation between the poles and the second order unit-step response 


3.4. Discretization of continuous-time systems 


The diagram in Figure 3.1, process excluded, represents a typical chain of digital 
processing of the signal that traditionally proceeds in several stages. 


The ADC periodically retrieves the values of the analog signal y(t) at the 
instants tų = kT . It returns the discrete signal y(k) consisting of successive samples 
of y,(t). This sampling operation can be standardized by the identity: 
y(k) = y, (kT). In reality, the ADC also carries out the digitization of the signal 
(the digitized signal can have only a finite number of values). We will not discuss in 
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what follows these quantification errors for simplicity reasons and we will note by 
E, the sampling operator: y = E_(a)- 


The computer is the processing unit that obtains signal u(k) from signals y(k) 
and r(k). The system is discrete. 


The ADC converts the discrete time signal u(k) into the analog signal u,(t). 
Several types of blockers can be considered but currently the most widespread is the 
0 order blocker, to which we will associate operator Bọ , that maintains constant the 
sample value for a sampling period (see Figure 3.1): u,(kT +7) =u(k), Vke Z, 
Vre[0,7[. 


An alternative that will not be considered here consists of using a so-called “first 
order” blocker, operating a linear extrapolation of the input signal from the 2 last 
samples: 


Ug (kT + D) = u(k)+ ae VkeZ, Yre [0,71 [3.25] 


Another point of view is to consider the discretized process of input u(k) and 
output y(k). It is a discrete system whose model can be obtained from the 
continuous-time model. The way it is obtained as well as its properties are at the 
heart of this section. The study of conditions under which we can reconstitute an 
analog signal from its samples (sampling theory) as well as the analysis of problems 
specific to the computerized control will not be discussed here. However, for more 
information, see [PIC 77, ROU 82]. 


The ADC is supposed to retrieve samples periodically. We note by 7 the 
sampling period. We also consider that ADC and DAC are synchronized. 


3.4.1. Discretization of analog signals 


We have previously defined the sampling operation by x = E i) by making 
the continuous-time signal x,(f) correspond to the discrete-time signal x(k). We 
can define the Laplace transform of x(t) and the z-transform of x(k) and pass 
directly from the first one to the second one due to Table 3.5 and operator Z: 


is Er Z 
Xal P) > XO x(k) > Xz) [3.26] 


Z 
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The inverse operation is possible only by issuing hypotheses!? on the original 
signal. Otherwise, the relation is not bijective and several analog signals can lead to 
the same discrete signal. In this case we talk of alias (sin @yk and sin(@, +27)k 
are two aliases of a same signal). 


























x(k) = x, (kT) 
X x(t), t20 a X(z 
a(P) a(t) a (2) 
l z 
p Fa) T(k) z 
: et e ZT Z 
pta z-e °T 
00) -a kT .- 7 
(p+a)? +02 e * t sin wt e sın OkT we aT S Og 
+0) +0 m = — 
s PL ate tT cosogz +e AT 
+O —& kT x 
( To? a e! coset | © SOSE z(z-e™®™T cosy) 
a oO k — — 
i 2d 2 ont cos@gz +e PE 




















Table 3.5. Table of transforms 


3.4.2. Transfer function of the discretized system 


Let (2) be the continuous system defined by transfer H(s) where s is the 
derivation operator. Let (2,,) be the discrete system obtained from (2) by adding a 
downstream sampler and an upstream 0 order blocker, like in Figure 3.1 (if (2) 
designates the continuous process and (2) the discretized process). If the two 
converters are synchronized and at a pace equal to the sampling period 7, we obtain 
for (27) the transfer Hr (q) = Er ° H(s)° By. We obtain without difficulty!3, from 
the relation y(k) = H7(q)u(k), the following relation: 


[3.27] 


Y(z)= Hr(z)U(z), with: Hr(z)=(1- zt) Z (#2) 


P 


12 Shannon condition. 
13 At input we apply a discrete impulse which will then successively undergo the various 
transformations, h(-) = (Er o H(s)° Bo o 0)(-), then we obtain Hp (z) from the z-transform of 


this impulse response. 
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Due to this relation and Table 3.5, we will be able to easily obtain the transfer 
function of the discretized system and, consequently, its frequency response. It is 
also shown (see next section) that if p, is a pole of the continuous system, then 
Pa =e’, whereas the poles of the discretized system {pg i=1,---n} are 
obtained from the poles of the continuous system {p,;, i=1,---n} using the relation: 


eagle Visis 
Pai =e Vi=l-n [3.28] 


Consequently, the stability of the continuous system (Re (p,;)< 0) leads to the 
stability of the discretized system ( | Pail <). 


3.4.3. State representation of the discretized system 


Let us consider this time the continuous system (2) which is described by the 
state representation: 


i (t) = A x, (0) + Bua (t) [3.29] 


Yalt) = Cxa (t) + Du, (t) 


Let u(k) be the input sequence of the discretized model. It is transformed into 
the constant analog signal u,(t) fragmented by the 0 order blocker before 
“attacking” the continuous system. We try to express the relation between u(k) and 
the sampled output and state vectors x(k)=x,(kT) and y(k)=y,(kT). We have, 
between the sampling instants tą =AT and tp} =(k+1)T, ug(t)=u(k) and 
consequently uw,(t)=0. Equation [3.29] is then rewritten between these two 
instants: 


) [3.30] 
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Based on the solution of this differential equation, we obtain: 





A 0 
patin) do o)" feto) fu o ea 
Ug (tha4) Ug (ty) 0 I Uy (ty) [3.31] 
= Xq (th) 
Yatk)=(C D) 4 a 


Finally, we can associate the state representation with the discretized system: 


Ar =e“? = 1+ AYT 





k+1)=A k)+ Bru(k 
i JEA palk) HBri) i: Br = BYT [3.32]!4 
y(k) = Cx(k) + Du(k) Aro Gop 
P=l+—+ 
2! 3! 


We obviously have, for H(s) = C(sI ~A)'B+ D, the transfer of the discrete 
system given by H7(q)=C(ql- Ar J Br + D and we find again relation [3.28] 
because the poles of H(p) and H7(z) are also the eigenvalues of matrices A and 
Ar, if we suppose the minimal realizations. 


3.4.4. Frequency responses of the continuous and discrete system 


The frequency responses of the continuous system and its discretization are 
given by H(jm) and He”). We can also mention here that if the impulse œ is 
expressed in rad/TU!>, the discrete impulse @g =@T is without size. The two 
frequency responses are very similar in low frequency, i.e. if @T <<. They are 
necessarily different in high frequency, since the frequency response of the 
discretized system is periodic, contrary to the one of the continuous system. 


14 If A is non-singular, we also have Br = Ate — DB. 
15 TU: Time Unit. 
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Figure 3.5. Bode diagram of the continuous system and its discretization 


3.4.5. The problem of sub-sampling 


Let us consider the case of a “standardized” pendulum subjected to a torque u. In 
2 


the absence of friction, it will be controlled by equation 0 +0 =u. 
dt 


d0 . : 
If we choose x” = o ; a as state vector, we obtain the state representation: 
t 


0 1 0 
x+| fu 
E A () . We easily verify that this system is controllable and 


y=( 0) 
observable. The discretized system at time T is controlled by: 


cosT sinT 1-cosT 
xX(tk+1) = | bel re Jeo 


-sinT cosT 
y=()=( 0)x() 
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It is also controllable and observable except if T =m, which corresponds to 
drawing a sample every half-period oscillation of the pendulum. This pace is 
obviously insufficient. In this case we talk of sub-sampling. 


If we change the perspective angle and if we use the formalism of transfer 
functions, the loss of controllability or observability illustrated above translates into 
a deterioration in the order of the discretized transfer. Let us illustrate this point with 
the help of the following example where H7(z) designates the discretized transfer 
obtained by transfer discretization H(p) with the sampling period T. 


2 2 
a +O Az+B 
H(p)= ze Hi aT 
(p+ a) +w z —2e % coswyz+e 
o O -aT 
A=1-e %T cosøoT -—e"°T sinoT 
; a 
with: 
= O -aT … = 
Boe? 4e OT sin oT -e 27 cosoT 
a 
i [ee 
We obtain for wT = 7 the first order transfer H re) ———> . We note that 
z+e 


this transfer has a negative real pole and that its impulse response is an alternated 
sequence that converges towards 0 if œ > 0. Such a behavior for a first order system 
does not have a continuous time equivalent. 


3.4.6. The problem of over-sampling 


We talk of over-sampling if the sampling period is “very small” with respect to 


the dominating pole Pe, of the continuous system, or even when |r Pe,|<<!. The 





. : Tp on. Sates tasty 
pole corresponding to the discrete system py, =e Peo ig in this case very close to 


the unit. Hence, it is important to have a high numeric precision for the value of this 
pole because if not the discretized system will be considered wrong with respect to 
stability. For 7 <<1, the state matrices (see equations [3.32]) are such that 
Aa -1 | <<1 and Ba | << 1. This point is illustrated by Example 3.1. 
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EXAMPLE 3.1. let us consider the continuous system: 


4 (t)=-@xq(+ aug (t) 6) 
Va (6) =Xq (£) 


We can obtain the state representation of the discretized system of equations 
[3.32]. For a sampling period T = 10 $s and ge {1,2}, we obtain a very good 
approximation: 


ue = (1-ax10°) x(6)+ax10 uk) G,) 
VK) = x(k) 


a =2 represents a system “twice as fast” as the case œ =1 (the time constant of 
the continuous system is 7 = | g). For all that, the coefficients of the realizations 


obtained by discretization differ only by their sixth decimal in the two cases 
(4q =1, By =0)- 


Such an over-sampling (aT ~10~°) is a source of numeric error unless there is 
high precision in the storing of coefficients. However, we note that this problem is 
not intrinsic and depends on the choice of the operator enabling the description of 
the discrete system. (2,7) can alternatively be described with the help of the 
operator dr or O7—Y (see section 3.3.3). From equations [3.32] and [3.22] we 
obtain here with high precision: 


Opx(k) = -ax(k)+au(k) (24) 
and with even higher precision (equation [3.24]): 
Opx(k) = -æ x(k)+au(k) (24) 


Even if very simple, this example makes it possible to appreciate the interest in 
choosing these operators in order to prevent the risks of numeric errors pertaining to 
over-sampling. The coefficients of the discrete model have the same order of 
magnitude as the ones of the continuous model and this for an arbitrarily small 
sampling period. For a more detailed analysis, see [GEV 93, MID 90, SWI 98]. 


Finally, for the simulation algorithm to be complete, x(k +1) = x(k)+Tô,x(k) or 
x(k+1)= Vx(k)+ a ôrx(k) should be calculated at each iteration. 
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3.5. Conclusion 


In this chapter, we introduced the basic concepts that make it possible to 
understand the discrete-time signals and systems. We saw that the z-transform has in 
the case of discrete-time a role similar to the Laplace continuous-time transform and 
that the lead operator was substituted to the derivation operator. The different 
concepts introduced in the previous chapters in terms of representation or structural 
properties of the systems could then be transposed without difficulty for the case of 
discrete-time systems. 


After briefly analyzing the behavior of basic discrete systems, we presented in 
short the issue of sampling passage from continuous-time signals and systems to 
discrete-time signals and systems. Our goal was to provide the basics that will make 
it possible to deal with (Chapters 8, 12 and 13) the digital simulation of continuous 
systems and their control by the computer. We deliberately ignored certain results, 
however essential, in signal theory but which were not strictly necessary in the 
context of this work. 
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Chapter 4 


Structural Properties of Linear Systems 


4.1. Introduction: basic tools for a structural analysis of systems 


Any physical system has limitations in spite of the various possible control 
actions meant to improve its dynamic behavior. Some structural constraints may 
appear very early during the analysis phases. The following example illustrates the 
importance of the location of zeros with respect to the solution of a traditional 
control problem which is the pursuit of model, by dynamic pre-compensation. Being 
given a transfer procedure equal to: 


p- 


(De 
(p+) 


is it possible to find a compensator c(p), so that the compensated procedure has a 
transfer equal to the one of the model previously fixed, #,,(p) ? It is well known that 
the model to pursue cannot be chosen entirely freely. Indeed, the pursuit equation 
t(p)c(p) =tm(p) imposes that the model must have the same unstable zero as the 
procedure, otherwise the compensator will have to simplify it and hence an internal 
instability will occur. In addition, the relative degree of the model (the degree of 
difference between denominator and numerator; we will refer to it later on as the 
infinite zero order) cannot be lower than 2, otherwise the compensator will not be 
appropriate. 
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The object of this chapter is to describe certain structural properties of linear 
systems that condition the resolution of numerous control problems. The plan is the 
following. 


After a brief description of certain main geometric and polynomial tools, useful 
for a structural analysis of the systems (section 4.1), we will describe the Kronecker 
canonical form of a matrix pencil, which, when we particularize it to different 
pencils (input-state, state-output and input-state-output) gives us directly, but with a 
common perspective, the controllable and observable canonical forms (of 
Brunovsky) and the canonical form of Morse (section 4.2). The following section 
(section 4.3) illustrates the invariance properties of the various structures of these 
canonical forms (indices of controllability, of observability, finite and infinite zeros) 
and of the associated transformation groups (basis changes, state returns, output 
injections). Two “traditional” control problems are considered (disturbance rejection 
and diagonal decoupling) and the fundamental role played by certain structures 
(invariant infinite and finite zeros, especially the unstable ones) is illustrated with 
respect to the existence of solutions, the existence of stabilizing solutions and 
flexibilities offered in terms of poles positions (concept of fixed poles). This is 
illustrated in section 4.4. Section 4.5 enumerates a few conclusions and lists the 
main references. 


4.1.1. Vector spaces, linear applications 
Let X and y be real vector spaces of finite dimension and VE ¥ and WC Y, two 


sub-spaces. Let L:  — J be a linear application. LY designates the image of V by 
Land LW designates the reverse image of W by L: 


LY: ={ye Y such that dre V and Lx = y} [4.1] 


LW: = {xe X such that Lxe W} [4.2] 


With this notation, imageImL and core KerL of L can also be written: 
ImL = LX and KerL = L '{0}. Naturally, the notation chosen for the reverse image 
should not lead to the impression that L would be necessarily reversible. 
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1 0 
EXAMPLE 4.1.— let us suppose that L = p à , and W is the main straight line 


Naa L {0} = KerL be 
0 1 


Let V be a basis matrix of V and W‘ a basis of the canceller at the left of W (i.e. 
a maximal solution of equation W'W = {0}), a basis of LY is obtained by directly 
preserving only the independent columns of LV. A basis of L~!W is obtained by 
calculating a basis of core Ker(W‘L). 








4.1.2. Invariant sub-spaces 


Let A: X — X be an endomorphism (linear application of a space within itself). 
Let n be the size of 4. A sub-space VC X is called A-invariant if and only if A 
VCY. This concept is adapted to the study of trajectories of an autonomous 
dynamic system, which is described in continuous-time or discrete-time by: 


x(t) = A x(t) or x(k +1) = Ax(k) [4.3] 


Indeed, any state trajectory initiated in an A-invariant V sub-space remains 
indefinitely in V. A-invariant sub-spaces form a closed family for the addition and 
intersection of sub-spaces (the sum and intersection of two A-invariant sub-spaces 
are A-invariant). Consequently, for any LC X sub-space, there is a bigger A- 
invariant (unique) sub-space included in £, noted by £ , and a smaller A-invariant 
(unique) sub-space containing £, noted by £, obtained as the bound of algorithms 
[4.4] and [4.5]: 








Lx, L'=L, L7=LOA'L,, L= Loa sa L"=L [44] 
Ly ={0} G =L, G =L+ AL, La =LtAG, > L, = Le [4.5] 


The concept of A-invariant sub-space also makes it possible to decompose the 
dynamics of an autonomous system of the type [4.3] into two parts, and to describe 
what happens inside and “outside” sub-space V. If we choose as first vectors of a 
basis of ¥ the vectors obtained from a basis of V and if we complete this partial 
basis, the property of A-invariance of V is translated through a zero block in the 
matrix representing A in this basis: 
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A A 
A= l y 12 | [4.6] 
0 Axy 


where A, represents the restriction of A to V and A yy represents the complementary 
dynamics (more rigorously this is a representative matrix for the application in 
quotient 4/V!). 


For controlled dynamic systems, where and U designate, respectively, the state 
space and the control space described by: 


X()=Ax(t)+Bu(t) or x(k +1) = Ax(k) +B u(k) [4.7] 


the (A,B)-invariance characterizes the property of having the capability to force 
trajectories to remain in a given sub-space, due to a suitable choice of the control 
law. A sub-space V of ¥ is (A,B)-invariant if and only if AVC VY + ImB. Similarly, 
Vis (A,B)-invariant if and only if there is a state return (non-unique): F: ¥ — U 
such that (A+BF)V c V. The sum of the two (A,B)-invariant sub-spaces is (A,B)- 
invariant, but this is not true for the intersection. For any sub-space £ C V there is a 
bigger (A,B)-invariant (unique) sub-space included in £ and noted by V (A,B,C). It 
can be calculated as the bound of the non-increasing algorithm [4.8]: 


vo =a, VIEL, VÄLA ImB)>V"=V"(ABL) [48] 


For the analyzed dynamic systems, where 4 and Yy designate the state space and 
the observation space, and described by: 


X(t) = A x(t) x(k +1) = Ax(k) 


[4.9] 
y(t) =C x(t) or y(k) = Cx(k) 


The (C,A)-invariance is a dual property of the (A,B)-invariance and is linked to 
the use of output injection. A sub-space S of # is (C, A)-invariant if and only if 
there is an output injection (non-unique) K: Y — ¥ such that (A+KO)S CS. 
Similarly, S is (C,A)-invariant if and only if A(S © KerC) c S . The intersection of 
two (C,A)-invariant sub-spaces is (C, A)-invariant, but this is not true for the sum. 
For any £ C ¥ sub-space, there is a smaller (C, A)-invariant (unique) sub-space 





1 Given V C X, the quotient X/V represents the set of equivalence classes for the relation of 
equivalence Æ defined on ¥ by VxeX, Vye: xRy & x-y € V. We can visualize 
(abusively) X/V as the set of vectors of X that are outside of V. 
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containing £ and noted by S,(C,A, £). It can be calculated as the bound of the 
following non-decreasing algorithm: 


So ={0}, Sı =L, S41 =L+A(GA KerC) > S, = &(C,A,L) 
[4.10] 


4.1.3. Polynomials, polynomial matrices 


A polynomial matrix is a polynomial whose coefficients are matrices, or, 
similarly, a matrix whose elements are polynomials, for example: 


0 1 1 2 1 -1 2 = 
p?+ p+ ul prl p°+2p-l1 [4.11] 
0 0 1 0 0 1 p 1 


A polynomial matrix is called unimodular if it is square, reversible and 
polynomial reverse. A square polynomial matrix is unimodular if and only if its 
determinant is a non-zero scalar. 


1 1 - 
For example: | is unimodular, its reverse being equal to f 4 . 


In the study of structural properties of a given dynamic system of the following 
type (with n xn A matrix): 


ilO) = A x(t)+B u(r) x(k +1) = Ax(k) +B u(k) 


[4.12] 
y(t) =C x(t) or y(k) = Cx(k) 


intervene several polynomial matrices with an unknown factor p. The best known is 
certainly the [pI-A] characteristic matrix that makes it possible to extract 
information on the poles. Other polynomial matrices make it possible to characterize 
properties such as controllability/obtainability, observability/detectability, or 
concepts grouping together state, control and output, especially in relation to the 
zeros of the system. These are, respectively, the matrices: 


[pI-A -B], ea and ea H [4.13] 
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All these polynomial matrices, which only make the two monomials in p° and p' 
appear, are called matrix pencils. All have the form [pE-H], with E and H not 
necessarily square or of full rank. Two pencils, formed by matrices of the same size, 
[pE-H] and [pE’-H’], are said to be equivalent in the Kronecker sense if and only if 
there are two reversible constant matrices P and Q such that [pE’-H’] = P [pE-H] 
Q. P and Q are the basis changes in the departure space ¥ and in the arrival space V. 


We will analyze, with the help of these matrix pencils, several structural 
properties of systems [4.12]. This will be done progressively in our work, from the 
simplest (pole beams) to the most complete (system matrix). 


4.1.4. Smith form, companion form, Jordan form 


The poles of system [4.12] are given by the eigenvalues of A (see Chapter 2). It 
is well known that these eigenvalues are linked to the dynamic operator A and not 
only to certain of its matrix representations. More precisely, the eigenvalues of A are 
not changed if we replace A by A’ = T'AT, where T designates any basis change 
matrix in X. When such a relation is satisfied, we say that A and A’ are equivalent. 
This relation is also written T'[pI-A]T = [pI-A’] and thus A and A’ are equivalent 
matrices if and only if the beams [pI-A] and [pI-A’] are equivalent in the Kronecker 
sense. An important interest in any equivalence notion, besides the division into 
separate equivalence classes that it induces on the set considered, is to represent 
each class by a particular element, called canonical form. In the case of [pI-A] type 
beams, we know well the companion form type canonical forms (see Chapter 2) or 
Jordan form. These forms are in fact obtained directly from the famous Smith form 
which is developed for the general polynomial matrices. In practice, it is quite easy 
to show from Binet-Cauchy formulae that, for any given size k, two equivalent 
beams [pI-A] and [pI-A’] have the same HCF (the highest common factor) of all the 
non-zero minors of order k. Let us note by o(p), Gp)... On(p) these different 
HCFs for k = 1 to n. Polynomials o(p) can be divided ascendantly (c,(p) divides 
œ(p) which divides 03(p)...). 


Let us introduce the following quotients: 6,(p) = 0,(p), Bb) = wpap), «..; 
BE) = pop). Polynomials A(p) can be divided ascendantly as well. 
Polynomials G(p) which are different from 1 are called invariant polynomials of 
[pI-A] (or of A). The last one (the highest degree one) is the minimal polynomial of 
A (it is the smallest degree polynomial which cancels A). The product of all A(p) is 
Q,(p), which is characteristic polynomial of A. The Smith form of [pI-A] is the 
diagonal of G(p). The invariant polynomials can be written in an extended form, or 
in a factorized form where the n eigenvalues of A appear (certain powers /; being 
then equal to 0): 
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k,-l k; by li; l 
ilp) =aiotanpt. tagpap +p" =p- po)" (p- pi)" .(p- pn)" [4.14] 


From the point of view of terminology, the p; singularities are called eigenvalues of 
A, (internal) poles of the dynamic system [4.12] and zeros of the beam [pI-A]. The 
companion form of A contains as many diagonal blocks as A (p) which are different 
from 1 and for each block, of size k; x k; all terms are zero except for the over- 
diagonal which is full of “1” and the last line consisting of coefficients —a; of (p). 
The Jordan form of A contains, for each eigenvalue p; as many blocks as £{p) having 
a factor (p-p;) ÿ. Each basic block of this type, of size l; x lj has all its terms zero 


cc, 99 


except for the diagonal which is full of “p?” and the over-diagonal which is full of “1”. 


Polynomials (p) are called invariant polynomials of A. The factors of these 
polynomials, i.e. (p-p;) lij are the invariant factors of A. The set of all Xp), as well as 
the set of all invariant factors, form complete invariants under the relation of 
equivalence, i.e. under the action of basis changes (meaning that two square matrices 
of the same size are equivalent if and only if they have exactly the same invariant 
polynomials). 


4.1.5. Notes and references 


The basic tools for the “geometric” approach of automatic control engineering 
(invariant sub-spaces) were introduced by Wonham, Morse, Basile and Marro at the 
beginning of the 1970s; in particular see [BAS 92, WON 85], as well as [TRE 01]. 
Numerous complements on the “polynomial” tools leading to Smith, Jordan or 
companion forms can be found in [GAN 66], as well as in [WIL 65], which is an 
almost incontrovertible work for everything relative to eigenvalues. 


4.2. Beams, canonical forms and invariants 


The pole beam associated with the dynamic system [4.12] is a [pE-H] type 
beam, but with the two following particularities: E and H are square and E is 
reversible. Before considering the general case, we will transitorily suppose E and H 
as square, but E as not systematically reversible. This extension should be brought 
closer to the more general class of implicit systems called regular, i.e. the systems 
described by: 


Jx(t) = A x(t) + B u(t) Jx(k +1) = Ax(k) +B u(k) 


y(t) =C x(t) or y(k) = Cx(k) 
[4.15] 
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with J not forcibly reversible, but [pJ-A] “regular?”, i.e. with a rank equal to n. In 
the case of continuous-time systems, such models particularly make it possible to 
manipulate the differentiators. For example, the following system describes a pure 
differentiator: 


k NE po? Jac ()=[1 0x 
00 QUI gor es 


[4.16] 


It has indeed for transfer C(pJ-A) 'B = p. This system has a pole infinity of order 
1. 


A [pE-H] type regular square beam, with E and H as linear applications of 4 
toward Æ and two isomorphic spaces of size n, will also have finite and infinite 
zeros. Among the most compact methods to illustrate these finite and infinite zeros 
of [pE-H], we can use the Weierstrass canonical form. We easily can, by using the 
basis changes in ¥ and in Æ, which are P and Q respectively, transform the 
departure beam into its Weierstrass canonical form. It is a diagonal form with two 
main blocks separating the infinite zeros from the finite zeros: 


pN-I 


PIPE-H1Q | ere 


| with N nilpotent [4.17] 


Hence, the structure of infinite zeros of [pE-H] is given by the Jordan structure 
of N (in zero because N has only zero eigenvalues). To better understand the fact 
that the singularities in “0” of N represent infinite singularities for the beam, it is 
sufficient to write pN-I = p(1/pI-N). In addition, the structure of finite zeros of 
[pE-H] is given by the structure of [pI-M], as in section 4.1.4. For example, the 
Weierstrass form of a generalized pole beam for a [4.15] type system with two 
infinite poles, one of order 1 and the other of order 2, and two finite poles, in p = —1 
and p = 0 respectively is given by: 


a0 0 0 
—1 p 0 
A a o veg peer P [p41], d=[p] 
pE-H]= with a= -1 pj, b= , c=[ptl], d=[p 
0 0c 0 0 -l1 
0 0 -1 
000d 





2 Le. det(pJ-A) is not identically zero. 
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A way to obtain the Weierstrass form described in [4.17] is to use the following 
algorithms, which are very similar to algorithms [4.5] and [4.4]: 


A= (0}, ANSE HAI > A2 Aj= EHA] [4.18] 

AS= x, AS'= HBAS, > A= A= HEA; [4.19] 
The regularity of the beam [pE-H] can be translated: 

A[® A5=X, ie. AÏ+A5= * and Ain A5= {0} [4.20] 


EA®HA;= X, ie. EA;+HA5=X and EAN HA5= {0} [4.21] 
This leads quite naturally to the following choice for P and Q: 


Q= | basis of Aj basis of A5 |, P= | basis of EA|| basis of HA; | 
[4.22] 


4.2.1. Matrix pencils and geometry 


In the general case, [pE-H] is a rectangular beam, with no particular hypothesis 
of rank, either on E or on H. This means that apart from the previously defined finite 
and infinite zeros, [pE-H] also has a non-trivial core and co-core. Polynomial 
vectors and co-vectors, x(p) and x'(p) then exist such that: [pE-H] x(p) = 0 and/or 
x'(p) [pE-H] = 0. The various possible solutions of these equations are in fact 
classified and ordered in terms of degrees. If x(p) is in the core of [pE-H], the vector 
obtained by multiplying each component of x(p) by a same polynomial is also in the 
core. Hence, we will consider the lowest degree solutions possible. For example, for 
a beam described by: 


p 1 0 
E-H]= 
i la p à) 
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a core basis vector of minimal degree can be described by [1 -p p’]', where “T” 
represents the transposition. Similarly, for a beam described by: 


ae JP 
[pE-H] H 


a co-core basis vector and of minimal degree can be described by [1 -p]. 


Then, through a reduction procedure with respect to these first solutions, we 
consider the following solutions of superior degree, but the lowest one possible, and 
so on. The result is that only the sequence of successive degrees is essential in order 
to properly describe the core and co-core in a canonical form. 


In order to describe the complete structure of a beam in its most general form, 
algorithms [4.18] and [4.19] are sufficient. An important difference with respect to 
the previous regular case is that, in general: 


Ar A3 # {0} when the core is # {0} and 


EA + HA, # X when the co-core is # {0} 


This geometric description is provided in the following section. 


4.2.2. Kronecker’s canonical form 
The main result for “any” beam is the following. 


Two beams [pE-H] and [pE’-H’] are equivalent in Kronecker’s sense, i.e. there 
are basis change matrices P and Q such that [pE’-H’] = P [pE-H] Q, if and only if 
[pE-H] and [pE’-H’] have the same Kronecker’s canonical form. 


Kronecker’s canonical form of a beam [pE-H] is a beam characterized only from 
E and H. This form can possibly contain identically zero columns and/or rows (this 
happens when in the core and/or the co-core there are constant vectors) and in 
addition it has a block-diagonal structure with four types of blocks: 


—finite elementary divisor blocks (also called finite zeros): these are (for 
example) Jordan blocks, of size kj; x kj, associated with (p-a;) kij type monomials. 
(We can also choose companion type blocks.) For example: 


for the monomial (p + 1), etc. [4.23] 
0 pti 
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— minimal index blocks per non-zero columns: these are rectangular blocks, of 
size EX (€;+ 1), having the form: 


p 1 0 
lp 1]fore=1, for € = 2, etc. [4.24] 
0 1 


P 


— minimal index blocks per non-zero rows: these are rectangular blocks, of size 
(mı + 1) x ni, which are identical to minimal index blocks per columns, but simply 
transposed, thus: 


H forn=1, etc. [4.25] 


— infinite elementary divisor blocks (also called infinite zeros): these are square 
blocks, of size v;xv;, with a diagonal full of “1” and an over-diagonal full of “p”, i.e. 
having the form: 


1 p 
[1] forv =1, 01 for v = 2, etc. [4.26] 


Kronecker’s canonical form is fully characterized by the list of polynomials 
(p-a;) “ij and by the three lists of integers {ei}, {ni} and {vi}. These four lists form 
full invariants for the beams under the action of basis changes in the departure and 
arrival spaces. An example of Kronecker’s canonical form (the index “K” is used to 
indicate that the beam is in its Kronecker’s canonical form) is given below, 
corresponding to the list of invariants: {(p-a;) Sj} = {p-3}, {e} = {2}, {ni} = {1} 
and {vi} = {2}: 


a 0 0 0 

0 b 0 0! , p 1 0 p 1 p 
Ep -H,]- with a=[p-3], b= a be , d= 
PEK AKI I) 6 co aN etl H a 

000d 


[4.27] 


Now, due to the two algorithms [4.18] and [4.19], we can provide the geometric 
characteristics of these invariants. For this, we will use the following notations: 
given a list of positive integers {n;}, J = 1 to /, ordered in a non-increasing manner 
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(i.e. n;2n;1), we associate with it the list {p;} which is defined by p;= card{n; 2 j}, 
where “card” represents the cardinal number, i.e. the total number of elements in the 
group. We note that the correspondence between the two lists {n;}, i = 1 to Z and 
{pj}, J = 1 to A is a bijection. Indeed, it is easy to verify that list {n;} also satisfies 
n;= card {p; 2 i} and consequently / = p; and A = n;. 


The geometric characteristics of Kronecker’s invariants are given below. We 
note at this level that these characteristics establish the invariance of the four lists 
under the action of P and Q basis changes in the departure and arrival spaces. 
Indeed, the sizes of intermediary sub-spaces are clearly invariant when we replace E 
and H by PEQ and PHQ: 


— minimal indices per columns: 

Yu 21, card {£; > u}= dim (A530 A #1) dim (450 A #) [4.28] 
— minimal indices per rows: 

VA>1, card (9, > u}= dim (Aj+AS')-dim (Aj+A4) [4.29] 
— infinite elementary divisors: 

Vu21, card {v; > u}= dim (45+.4#)- dim (45+ AL) [4.30] 


— finite elementary divisors. From the definitions of algorithms [4.18] and [4.19] 
it is easy to verify that, not only: 


HA;cEA;, but also: H(A30.A}) CE(A50A}) 


In addition: dim (A5)-dim (A 5N AĮ) = dim (EA3)-dim (E(A50 Ai). 


The finite elementary divisors of the beam [pE-H] are then given by the finite 
elementary divisors (in the sense of Smith’s form; see section 4.1.4) of the next 
square operator, double restriction of H to two quotient spaces (in the departure and 
arrival spaces): 


H: AA NA EA3/E(A50A)) [4.31] 
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These general results on “any” beam will be now focused on some interesting 
cases that will differently clarify certain structural properties of [4.12] type systems. 


4.2.3. Controllable, observable canonical form (Brunovsky) 


Let us go back a little to the controlled dynamic systems without output equation, 
with ¥ and U representing the state space and the control space. In order not to have 
to distinguish controllability and obtainability, we will limit ourselves here to 
continuous-time spaces, as described in [4.7]: 


(t) =A x(t) +B u(t) [4.32] 


We can “naturally” associate with this system the controllability beam [pI-A - 
B], i.e. for which E = [I 0] and H = [A Bj. Due to the subjectivity of E, 
Kronecker’s form of the controllability beam can have only two types of invariants, 
i.e. minimal indices per columns and finite elementary divisors (indeed, for the other 
types of blocks see [4.25] and [4.26], the block sub-matrix in E is not of full rank 
per row and hence it cannot be a part of the global subjective E). These invariants 
have a tighter connection with more traditional concepts, such as the controllability 
indices and the non-controllable poles. More exactly, we can easily show that the 
minimal indices per columns of the controllability beam are exactly equal to the 
controllability indices of the pair (A, B). The finite elementary divisors of the 
controllability beam correspond exactly to the non-controllable dynamics (with 
multiplicities considered through the invariant factors) of the pair (A, B). This will 
be mentioned in section 4.3. Before, we will characterize the group of 
transformations acting on the dynamic system [4.32] and that is equivalent to the 
group of basis changes on the left and right on [pI-A -B]. 


“Kronecker’s” group of transformations acting on the controllability beam 
[pI-A -B] corresponds identically to the “feedback” group acting on the pair (A, B), 
in other words formulated: 


JP & Q reversible such that: P[pl-A -B]Q=[pI-A' -B'| 


© AT & G reversible & JF such that: A'=T~!(A+BF)T, B' =T7!BG 
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Le i 1 T 0 
(To be sure, it is sufficient to note that P=T™ and Q = FT G .) 


Kronecker’s canonical form of a controllability beam [pI-A -B] thus contains 
only minimal index blocks per columns and, possibly, blocks of finite elementary 
divisors. In order to show the quasi-immediate relation that exists between this 
Kronecker’s form and the more traditional controllability canonical forms (like 
Brunovsky’s form) we will take an example for which the minimal indices per 
columns are equal to {£1} = {1}, {€2} = {2}, and a finite elementary divisor is equal 


to {p+2}: 


P 


[pEx -Hg]= 0 


with a =[p 1], b= i c=[p+2] 


oo 8 
Qo So 
a © © 


Since this form is associated with [pI-A -B], we can write it differently so that it 
maintains a controllability beam form, which will be noted by [pI—A, -B,]. This is 
easily obtained by switching all the constant columns in the last positions. The pair 
(Ao Be) thus obtained is in Brunovsky’s controllable canonical form and, just by 
reading it, we note that the controllable space is of size 3, the pole in {-2} is non- 
controllable and the controllability indices are 1 and 2 (see section 4.3): 


ooo. © 
or © © 


oo = © 


—2 


The general structure of matrices (A,, Be) in Brunovsky’s canonical form is the 
following: 


diag{A:} 0 
0 A 


c 








B. = A 
0 


non C 
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with: 
0 1 0 … 0 0 
0 0 10. 0 0 
Agi E| … …  ! Ba =] [4.33] 
0 0 0 0 1 0 


Oo 
© 
© 
(=) 
© 
= 


Blocks A ,; are of size E;xe;; blocks B,; are of size €, x 1; the remaining matrix 
Anon c (that can be described, for example, in Jordan’s form; see section 4.1.4) is of 


size [nei r-z). It does not exist if the system is controllable: it 
i i 


describes the non-controllable dynamics; integers £g; are the controllability indices of 
the pair (A, B). 


What has just been illustrated for controllability is also applicable and in a dual 
way to observability. 


Let us go back a little to the dynamic systems without a term of control, with # 
and y designating the state space and the observation space respectively. We will 
limit ourselves here to continuous-time systems as described in section 4.9: 


x(t) = A x(t 
o © [4.34] 
y() =C x() 
We can “naturally” associate the observability beam with this system: 
pl-A 
4.35 
ay 4.35) 


i.e. for which E = [I 0]' and H=[A‘ C’]. Due to the injectivity of E, Kronecker’s 
form of the observability beam can have only two types of invariants, i.e. row 
minimal indices and finite elementary divisors (indeed, for the other types of blocks 
see [4.24] and [4.26], the block sub-matrix in E is not of column full rank, and hence 
it cannot be a part of the global injective E). These invariants have a tighter 
connection with more traditional concepts, such as the observability indices and the 
non-observable poles. More exactly, we can easily show that the minimal indices per 
rows of the observability beam are exactly equal to the observability indices of the 
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pair (C, A). The finite elementary divisors of the observability beam correspond 
exactly to the non-observable dynamics (with multiplicities considered for the 
invariant factors) of the pair (C, A). This will be mentioned in section 4.3. Before 
this, we will characterize the group of transformations acting on the dynamic system 
[4.34] and that is equivalent to the group of basis changes on the left and right on 
[pI-A" -C'T. 


“Kronecker’s” transformation group acting on the observability beam 
[pI-A‘ -C']' corresponds identically to the “injection” group acting on the pair 
(C, A), in other words formulated: 





: pl-A pl-A' 
JP & Qreversiblesuchas: P c Q= c 


< JT & H reversible & JR suchas: A' = TA +RO)T, C' =HCT 
TU T'R 
(To be sure, it is sufficient to note that: P = , and Q=T.) 
0 H 


Kronecker’s canonical form of an observability beam [pI- A’ -C']' thus 
contains only blocks of minimal indices per rows and, possibly, blocks of finite 
elementary divisors. In order to show the quasi immediate relation that exists 
between this Kronecker’s form and the more traditional observability canonical 
forms (like Brunovsky’s form) we will take an example for which the minimal 
indices per rows are equal to {nı} = {1}, {n2} = {2} and a finite basic divisor is 
equal to {p+5}: 


0 
[pEx -Hg]= p|, c=[pt5] 
1 


© © 8 


0 0 
b O| with TH b= 
0 


Cc 


orn 


Since this form is associated with [pI-A’ -C"]', we can write it differently so 
that it maintains an observability beam form, which will be noted by [pI-A,' el T 
This is easily obtained by switching all the constant rows in the last positions: 
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0 0 0 0 
0 0 1 0 1 0 0 0 
0 0 0 0 0 1 0 0 
0 0 0 -5 


The pair (A+, Co) thus obtained is in Brunovsky’s observable canonical form. 
The unnoticeable space is of size 1. The pole in {-5} is un-observable and the 
unobservable indices are | and 2 (see section 4.3). 


The general structure of matrices (A,, Co) in Brunovsky’s canonical form is the 
following: 


Jesna? Jopa] n 





7 0 Anon o | 
0 0] 
00 1 0.. 0 
Agree. co tae Se cele Caol 0 0 0 0] [4.36] 
1 
0 


Blocks A,; are of size nı X ni, blocks C., are of size 1 x n; and the remaining 
matrix An (that can be described, for example, in Jordan’s form, see section 4.1.4) is 


of size c -Y m fr = En) . It does not exist if the system is observable; the 
i i 
integers n; are the observability indices of the pair (A, C). 


Let us consider now more general dynamic systems, with u inputs and y outputs. 


4.2.4. Morse’s canonical form 


The systems described by equation [4.12], i.e.: 


x(t) = A x(t) +B u(t) x(k +1) = Ax(k) + B u(k) 
y(t) =C x(t) or y(k) = Cx(k) 
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have as “naturally” associated beam the following matrix, known as Rosenbrock’s 
“system matrix”: 


[4.37] 


I-A -B 
ipe-m=|" | 


-C 0 


For this beam: 


I 0 A B 
E= and H = 
f 1 è ‘| 


“Kronecker’s” group of transformation acting on the system matrix [4.37] 
corresponds identically to the “feedback and injection” group acting on the system 
[4.12], in other words formulated: 


i pI-A -B pl-A' -B' 
JP & Q reversiblesuchas: P Q= 
-C 0 -C' 0 


© JT,G & H reversible & IF & R such as: 


A' =T! (A+BF+ROT, B' =T! BG, C'=HCT 


-1 =] T 0 
(To be sure, it is sufficient to note that: P = E TR andQ = ) 
0 H FT G 


Kronecker’s canonical form of a system matrix contains in general all the 
possible types of blocks. To visualize in terms of matrices A, B and C the form of 
the canonical representation obtained, it is sufficient, like in the previous case, to 
switch the rows and columns in order to move to the right all the constant columns 
(representative of the input matrix) and to the bottom the constant rows 
(representative of the output matrix). Let us take again the example [4.27] of section 
4.2.2, in which there is a block of each type: 
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p 1 0 P l p 
Ep -Hẹļ|= tha=lp-3l.b= = d= 
[p K K] wilh a [p iF à ile p! F À 


ooo 8 
S © & © 
on © © 
X © © © 


The corresponding matrices have then the following form (written here by 
preserving the order of blocks), which is called Morse’s canonical form, and noted 
by (Am, Bm, Cw): 











3000 0 0 0 
00100 0 0 

Ay =|0 0 0 0 0, By =l1 0 
0000 0 0 0 
10 0 0 0 0! 0 11 

ea a 
0000 1, 





The general structure of triplets (Ay, By, Cm) in Morse’s canonical form is the 
following: 


A 0 0 0 0 0 
re 0 A, 0 0 By =|"? 0 “|p 0 C, | 14.38] 
0 0 A3 0 0 0 00 0 C4 

0 0 0 AG 0 By 


where A, is in Jordan’s form (A2, B2) in controllable canonical form [4.33], (A3, C3) 
in observable canonical form [4.36] and (A4, By, C4) in simultaneously controllable 
[4.33] and observable [4.36] form. 


The parts having the indices “2” and “3”, which characterize certain core 
structures (on the right and left), have an important but very particular role in certain 
control or observation problems, called non-regular. We will not discuss in detail 
this aspect here. However, the parts having the indices “1” and “4” that are the result 
of finite and infinite elementary divisors of the system matrix are directly linked to 
invariant finite zero and infinite zero type structures, which we will deal with in 
section 4.3. 
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4.2.5. Notes and references 


The general context of matrix pencils, and particularly Kronecker’s canonical 
form, is detailed in [GAN 66]. The “geometric” presentation done here is mainly 
based on the works of [LOI 86]. A main reference work for the study of various 
beams associated with the analysis of linear systems, such as the system matrix, is 
[ROS 70]; for everything that is more particularly linked to the canonical forms 
presented here as derived from Kronecker’s form, the reader can refer to [BRU 70, 
MOR 73, THO 73]. 


4.3. Invariant structures under transformation groups 


It is exactly because they are invariant under the action of various transformation 
groups that the structures previously introduced have a fundamental role in the 
analysis and synthesis of observation and/or control systems. For example, the poles 
of a given system (in open loop) are invariant by basis changes but they are not so 
by state returns: it is well known in fact that a property equivalent to state 
controllability is the capability to freely modify the poles by state return. However, 
the invariant zeros, finite and infinite, are not at all modifiable by such actions. That 
is why their location conditions the resolving of traditional control problems. In the 
following sections, we will recall a few invariance properties of the main structures 
connected to linear systems. 


4.3.1. Controllability indices 


The controllability indices and the invariant factors of the non-controllable part 
(if it exists) of the pair (A, B) (see section 4.2.3) form a set of full invariants under 
the action of the transformation group (T, F, G) where T and G designate the basis 
changes on the state and on the control, and F is a state return. This “feedback” 
group is defined by: 


(A,B) TES), (T7! (A + BF)T, TBG) 


This basically means that any control law in the form of a regular state return, 1.e. 
u(t) = Fx(¢) + Gv(¢), with G reversible, maintains these structures. Through a 
connection with a more “traditional” definition of controllability indices, noted by 
{cc Cm} where m is the size of the control space, we recall that the general 
characterization of minimal indices per columns as described in [4.28], when 
particularized to the controllability beam [pI-A -B], with B of full rank (injective), 
gives very directly: 
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card {cj} = card tc; >1}=m:= rank(B) 


card {c ; > i} = rank([B AB ...A*"B))~rank((B AB ...A* 7B), for i > 2. 


4.3.2. Observability indices 


The observability indices and the invariant factors of the non-observable part (if 
it exists) of the pair (C, A) (see section 4.2.3) form a set of full invariants under the 
action of the transformation group (T, R, H) where T and H designate basis changes 
on the state and on the output respectively and R is an output injection. This 
“injection” group is defined by: 


(C, A) RH), (T71 (A + ROT, HCT) 


A more “traditional” definition of the observability indices, noted by 
{01,07,...,0;} where / is the size of the output space, can be found in connection to 
the general characterization of minimal indices per rows such as described in [4.29], 
particularized to the observability beam [pI-A' -C']', with C of full rank 
(subjective): 


card {0 ;} = card {0 ; >1} =/:= rank(C) 





C C 
CA CA 
card to; > i} = rank — rank , for i > 2. 


exit] [ais 


4.3.3. Infinite zeros 


As introduced in section 4.2.4, Morse’s canonical form, (Ay, By, Cm) described 
in [4.38], is obtained from the initial system, let us say (A,B,C), by the action of an 
element of the “feedback and injection” transformation group, let us say (Ty, Fm, 
Gy, Rm, Hy). This form is in fact maximally non-controllable and non-observable. It 
is in fact important, based on its particular structure, to verify that the system 
transfer matrix written in Morse’s canonical form will use only the part having the 
index “4” linked to the infinite elementary divisors and has a diagonal form: 
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CylpI-Ayl' By = Calpi-A4f"B, = diag(p "} 


where n; i = 1 to r is the size of each block in part “4” which is in controllable and 
observable canonical form. For example, the following form (A4, B4, C4) where, to 
simplify writing, all the non-specified terms are zero: 














0 1 
1 0 i 
0 0 1 
Ag= > B4 = > C4= 1 0 
0 1 0 0 i eo 
0 0 1 0 
i 0 0 0| i i 


corresponds to the list {n;}= {1,2,3}. The corresponding system has 3 infinite zeros, 
of orders 1, 2 and 3. This is the result of the transfer diagonal structure of Morse’s 
canonical form and because the transformations that lead to the canonical form of 
the system maintain the structure of zeros infinity. 


Indeed, based on the relations: 
Ay =Ty(AtBFy +RyOTy; By =TwBGy; Cy =HyCTy 
the passage from (A,B,C) to (Ay, By, Cm) is reflected in the following relation: 
Cy (pl- Ay) By = B,(p)[C(pI- A) 'B]B(p) 
with: 
B, (p) :=[I-F,,(pI- A) B] IG d 
1 (2) = M? ) ] M an 
-1 -1 
B, (p) =H yO -C(pI- A- BF), ) Ryl 


Transfers B,(p) and B,(p) have the particular property of being biproper 
matrices: a biproper matrix is a proper matrix (i.e. whose bound is finite when p 
tends toward infinite), reversible and when reversed, also proper. A biproper matrix 
is no more than a unimodular matrix (see section 4.1.3), but on the ring of 
eigenfunctions. A scalar biproper is any transfer function in which the numerator 
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and denominator have the same degree. A unimodular (polynomial) has neither pole 
nor finite zero (its Smith’s form is reduced to the identity; see section 4.1.4), a 
biproper on the other hand has only poles and finite zeros and it cannot simplify (by 
product) any singularity to infinity. The behaviors at infinite of (A,B,C) and 
(Am, By, Cm) are thus identical. The behavior of (Ay, By, Cm) is roughly described 
by the list of p”i. The integers n; which are equal in number to the rank of the 
system, are called the orders of infinite zeros of the system considered. 


In a purely “transfer matrix” context, we thus define Smith’s canonical form to 
infinity, which is the canonical representation under the action of the transformation 
group by multiplications, on the left and right, through bipropers. 


The general relations of the [4.30] type also make it possible to geometrically 
characterize the orders of infinite zeros. 


4.3.4. Invariants, transmission finite zeros 


As previously recalled, any multiplication of a given transfer by a unimodular 
preserves the finite singularities of this transfer (a unimodular has only poles and 
infinite zeros). The group of transformations obtained by multiplications on the right 
and left by unimodulars makes it possible to associate with each transfer matrix its 
canonical form, called Smith McMillan’s form, from which the so-called 
transmission poles and zeros can be calculated (linked to the transfer, i.e. to the 
controllable and observable part of the system considered). Synthetically, we can 
obtain it as follows: 


— write the departure transfer, let us say T(p), as T(p) = [1/d(p)] N(p), where d(p) 
is the LMCD (the lowest multiple common denominator) of all the denominators 
present in T(p); 


—write N(p) in Smith’s canonical form (by unimodular actions on the right and 
left); 


— divide each term of the diagonal thus obtained by d(p) and perform all the 
numerators/denominators possible simplifications. 


Hence, we reach a diagonal formula (always with r elements, r being the rank of 
the system), of type €(p) /wi(p), where £1(p) divides €,(p), ..., divides €,(p) and w,(p) 
divides W,.1(p),..., divides w:(p). The transmission poles and zeros of T(p) correspond 
to the roots, respectively, of the denominators y;(p) and the numerators £;(p). 


These transmission structures are related to the “open loop” transfer. They are 
invariant under basis changes but do not remain invariant under the action of 
transformations such as state return or output injection. 
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If we consider a transfer state realization T(p), let us say (A,B,C), the invariant 
zeros defined from the finite elementary divisors of the associated system matrix 
(see section 4.2.4) are invariant under Morse’s group (basis changes, state returns 
and output injections). If the state realization is minimal, the invariant zeros coincide 
with the transmission zeros. Otherwise, the transmission zeros form only a sub- 
group of all invariant zeros. 


4.3.5. Notes and references 


The various structures presented in this section, such as controllability/ 
observability indices and finite/infinite zeros are described in detail in [KAI 80, 
ROS 70] and many other works. 


4.4, An introduction to a structural approach of the control 


The objective of this section is to illustrate, based on relatively traditional control 
problems, the fundamental role played by certain structures (and we will dedicate 
our attention to infinite and finite zeros) in the existence of solutions. We will 
consider in particular the disturbance rejection and the diagonal decoupling. 


Let us consider a stationary linear system in which u(#) represents a control input 
with m components, d(f) a disturbance input with q components and y(t) an output to 
control with / components and described by the state model: 


x(t) = Ax(t) + Bu(¢) + Ed(¢ 
| (t) (t) + Bu(t) + Ed(¢) [4.39] 
y(t) = Cx(0) 
to which the following transfer matrices are also associated: 
-1 -1 
Ta(2)=C(pI-A) B and Ti(p)=C(pl-A) E [4.40] 


The problem of disturbance rejection by state return is formulated as follows: 
finding, if it exists, a state return having the form u(t) = Fx(^ + Ld(® so that, for the 
system thus looped, the transfer matrix between d(p) and y(p) is identically zero. 
When disturbance d(f) is not measured, we impose L = 0. The problem of 
disturbance rejection with internal stability consists of researching, if they exist, F 
solutions so that, in addition, (A + BF) is stable. 
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The problem of diagonal decoupling by regular state return is formulated as 
follows: finding, if it exists, a regular state return having the form u(f) = Fx(A) + Gv(A), 
with reversible square G so that, for the system thus looped, the transfer matrix 
between v(p) and y(p) is diagonal (with principal diagonal), i.e. in the form: 


Tr c (P) = C(pl- A-BF) | BG =[diag{h (p), +, hi(p)}: 0] 


The decoupling problem with internal stability consists of researching, if they 
exist, F solutions so that, in addition, (A + BF) is stable. 


4.4.1. Disturbance rejection and decoupling: existence of solutions 


The action of a state return type control law, as described in the previous section, 
as well as for rejection and for decoupling is translated in terms of transfer matrices 
by the multiplication on the right by a particular biproper matrix. Since such a 
transformation maintains the structure of infinite zeros, it is very natural to see 
conditions of existence of solutions for this type of structure appear. To illustrate 
this, we use the pre-compensator, which is equivalent to the control law selected. 


For disturbance rejection, the transfer between d(p) and y(p) for the compensated 
system by the control law u(f) = Fx(^ + Ld(f) is equal to T,(p)C(p) + Ta(p), that we 
want to cancel, with: 


C(p)=[I- F(pI- A) BY! [F(pI- A) E +L] 


It is easy to realize that C(p) is always proper, even strictly proper (i.e. the bound 
of C(p) is equal to zero when p tends toward infinity) when L = 0, i.e. when the 
disturbance is not available for the control law. 


The equation reflecting the objective of this rejection, i.e. Ty(p)C(p) + Ta(p) = 0, 
can be rewritten as: 


2 I C . 
ET I], Pen o [4.41] 


In this equation, the matrix where C(p) intervenes is biproper (since C(p) is 
proper). A necessary condition for [4.41] to have at least one proper solution is for 
[Ta(p) | Ta(p)] and T,(p) to have exactly the same orders of infinite zeros (because 
this structure is invariant under multiplication by a biproper). It turns out that this 
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condition is also sufficient. We can also show quite simply that this necessary and 
sufficient condition can be reduced to the comparison of two integers. We will 
designate “infinite rollout” the sum of orders of infinite zeros for a given system. 


The disturbance rejection is solvable by u( = Fx( + Ld(d type state return if 
and only if (A,B,C) and (A, [B | E],C) have the same rank and the same infinite 
rollout. 


Variants of this type of result exist when the disturbance is not measured, as well 
as when the state is not measured. In the second case, the existence of control laws is 
dealt with very similar measurement dynamic returns. 


For the decoupling problem, the action of a regular state return 
u( = Fx( + Gv, with reversible square G, is equivalent to the transfer 
multiplication T,(p) by the equivalent biproper pre-compressor: 


C(p):=[1-F(pI- A) |B] 'G 


Let us consider, in order to simplify the explanation, the case of square systems 
(having as many input components as control components) and reversible systems. 
The objective of decoupling is then expressed by the equation: 


Ty (p) C(p) = diag{h, (p), =, hi (p)} [4.42] 


Based on the diagonal form desired, a necessary condition for this equation to 
admit a biproper solution is that, on the one hand, the system is seen in its entirety, 
and on the other hand the reunion of all sub-systems row by row have exactly the 
same orders of infinite zeros (because this structure is invariant under a biproper 
multiplication). It turns out that this condition is equally sufficient. In addition, we 
can also show that this necessary and sufficient condition can be expressed with a 
single integer via the infinite rollout. 


For a system supposed reversible on the right (i.e. whose transfer is of full rank 
per rows), decoupling is solvable by u(¢) = Fx(4) + Gv(f) type regular state return if 
and only if the infinite rollout of (A, B, C) is equal to the sum of infinite rollouts 
calculated for each row sub-system (A, B, ci), where c; designates the i" row of C. 
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4.4.2. Disturbance rejection and decoupling: existence of stable solutions 


When the (natural) constraint of internal stability is added, the unstable zeros (if 
they exist) will have a similar role to the one of the infinite zeros with respect to the 
existence of solutions. The simplest way to be sure of this is to be able to formulate 
the control problem from the “transfer” equation. Before that, we must of course 
assume that the system considered can be stabilized. We can, however, assume it is 
already stable in open loop (if not a first stabilizing loop is to be used). The internal 
stability of the compensated system is hence translated simply by the necessary 
stability of the compensator researched. We must then solve a [4.41] or [4.42] type 
equation, on the ring of eigen and stable functions, and not only of eigenfunctions. 
The infinite zeros or the zeros with unstable values will then intervene as 
fundamental ingredients for the existence of solutions. For a system given under its 
state description, we will designate by “infinite and unstable rollout” the integer 
obtained by calculating the sum of the infinite rollout with the total number of 
unstable invariant zeros (sum of orders of multiplicity, irrespective of the 
corresponding particular (unstable) location). Thus, we obtain fairly simply the 
following results: 


-the disturbance rejection is solvable with internal stability by 
u(4) = Fx(4) + Ld(? state return if and only if (A, B, C) and (A, [B | E],C) have the 
same rank and the same infinite and unstable rollout; 


— for a system assumed to be reversible on the right, decoupling is solvable with 
internal stability by u(f) = Fx( + Gv( type regular state return if and only if the 
unstable and infinite rollout of (A,B,C) is equal to the sum of infinite and unstable 
rollouts calculated for each row sub-system (A, B, c:), where c; designates the i" row 
of C. 


4.4.3. Disturbance rejection and decoupling: flexibility in the location of poles/ 
fixed poles 


The results presented in the two previous sections are basically multi-variable in 
nature. They are obviously all the more valid in particular cases like, for example, in 
mono-variable cases. 


In this broad context of multi-variable systems, when the control problem 
considered is solvable, “the” solution is generally non-unique. Apart from 
researching, among all possible solutions, at least one stabilizing solution, we are 
often tempted to take advantage of the remaining degrees of freedom in order to 
fulfill supplementary objectives and especially to target certain poles (not only stable 
but, for example, sufficiently damped) for the solution looped system. The question 
of the possible flexibility in terms of the poles’ position then arises. 
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For the various control problems mentioned in this chapter (i.e. model pursuit, 
disturbance rejection or decoupling, etc.), it turns out that the simple fact of wanting 
to solve this “exact” problem leads to the inevitable appearance of an entire set of 
poles which are present in any solution. These poles are the “fixed poles” of the 
problem considered. Knowing them makes it possible to delimit the constraints 
imposed by the problem in terms of modifications of dynamics. Reaching a few 
“controllability” type (minimal) hypotheses, we can then find solutions that make it 
possible to position all the other poles, except for, obviously, these fixed poles, 
which, again, find their origin in the non-coincidence of certain structures of 
invariant finite zeros. We can in fact set the following results, which make it 
possible to look at the previous section as a particular case. We assume, for all the 
mentioned, cases that the control problem mentioned is solvable, in the sense of 
section 4.4.1: 


—the fixed poles of disturbance rejection by state return coincide with the 
invariant zeros of (A, B, C) which are not invariant zeros of (A, [B | E],C). When 
the extended pair (A, [B | E]) is “globally” controllable (a totally natural 
hypothesis), all the other poles (other than these fixed poles) can be positioned by a 
proper choice of state return; 


—the fixed poles of the decoupling by regular state return coincide with the 
invariant zeros of (A,B,C) which are not in the group obtained by bringing together 
the invariant zeros of each row sub-system (A, B, ci), where c; designates the i™ row 
of C. When the pair (A,B) is controllable, all the other poles (other than these fixed 
poles) can be positioned by a proper choice of the state return; 


— the existence of stable solutions is simply equivalent to the juxtaposition of two 
conditions: existence of solutions (section 4.4.1) and stability of all (possible) fixed 
poles. 


4.4.4. Notes and references 


The disturbance rejection and the diagonal decoupling were the object of 
numerous contributions, i.e. [BAS 92, TRE 01, WOH 85] for geometric treatments. 
Additional information on the treatment of this kind of control problem based on 
rational equations and especially the use of various rings in order to find solutions, 
can be found in [VID 85]. The results pertaining to the existence conditions 
expressed in terms of infinite structures were the object of several theses, such as 
[DIO 83, MAL 85]. Among the most recent contributions that complete the 
presentation of the existence of stable solutions, the use of rollouts and in particular 
the fixed poles and the remaining degrees of freedom, we can mention [MAL 93, 
MAL 97] and [MAR 94, MAR 99]. 
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4.5. Conclusion 


This chapter is to be considered as an introduction to a structural approach of 
the control. Its main objective was to introduce, by a simultaneous use of geometric 
and algebraic approaches, an entire set of structures closely related to the system 
considered and to illustrate the fundamental role they play in solving the control 
problems. 


The presentation was limited to the linear, stationary and finite size case. 
Extensions of certain results are available for the more general classes of systems, 
for example, non-linear [MOO 87] or with delays [RAB 99]. The object of this final 
section is to mention another extension in the field of optimization. 


4.5.1. Optimal attenuation of disturbance 


When the “exact” disturbance rejection, as formulated at the beginning of section 
4.4, is not solvable with stability due to the presence of at least one unstable fixed 
pole, the designer has the alternative to tone down the objective of the control. 
Instead of targeting an exact rejection, we can limit ourselves to an attenuation 
(optimal if possible) of this disturbance, as per a certain standard. 


This point of view was largely developed by several authors such as Saberi, 
Sannuti and Stoorvogel (see, for example, [SAB 96]). Due to a reformulation of the 
optimization problem into an exact problem where the matrices of the corresponding 
state model are slightly modified [STO 92], the solutions of the H»-optimal 
attenuation problems can be obtained from the analysis of exact rejection problems. 
The same applies for the H2-optimal fixed poles, which are present in any optimal 
solution (see [CAM 00]). 
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Chapter 5 


Signals: Deterministic and Statistical Models 


5.1. Introduction 


This chapter is dedicated to signal modeling procedures and in particular to sta- 
tionary random signals. After having discussed the spectral characterization of deter- 
ministic signals, with the help of the Fourier transform and energy spectral density, we 
will now define the power spectral density of stationary random signals. We will show 
that a simple modeling by linear shaper filter excited by a white noise makes it possi- 
ble to approach a spectral density with the help of a reduced number of parameters and 
we will present a few standard structures of shaper filters. Next, we will extend this 
modeling to the case of linear processes with deterministic input, in which the noises 
and disturbances can be considered as additional stationary noises. Further on, we will 
present the representation in the state space of such a modeling and the relation with 
the Markovian processes. 


5.2. Signals and spectral analysis 


A continuous-time deterministic signal y(t),t € R is, by definition, a function of 


Rin C: 
y: R — C 
tr y(t) 


where variable t designates time. In short, we speak of a continuous signal even if the 
signal considered is not continuous in the usual mathematical sense. 
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A discrete-time deterministic signal y[k],k € Z is, by definition, a sequence of 
complex numbers: 


y= (ulk]) pez 


In short, we often speak of a discrete signal. In general, the signals considered, 
be they continuous-time or discrete-time, have real values, but the generalization to 
complex signals done here does not entail any theoretical problem. 


The spectral analysis of deterministic signals consists of decomposing them into 
simpler signals (for example, sine curves), in the same way as a point in space is 
located by its three coordinates. The most famous technique is the Fourier transform, 
from the French mathematician J.B. Fourier (1768-1830), which consists of using 
cisoid functions as basic vectors. 


The Fourier transform y( f) of a continuous-time signal y(t) is a function of the 


form Y : fro y( f) of a real variable with complex number value, which is defined 
for any f by: 


= +0 
u (f)= / y(t) e929" FF dt [5.1] 


— OO 


We note from now on that if variable t is homogenous to a certain time, then vari- 
able f is homogenous to a certain frequency. We will admit that the Fourier transform 
is defined (i.e. the integral above converges) if the signal has finite energy. The Fourier 


transform does not entail any loss of information. Indeed, knowing y( f), y(t) can be 
rebuilt by the following reverse formula; for any t: 


+oo | 
y(t) = J Deora [5.2] 


— CO 


The Fourier transform is in fact the restriction of the two-sided Laplace transform 


y(s) to the axis of complex operators: YF) = ÿ(327 f) with, for any s € C: 


+00 
ÿ(s) = J y(t) e ** dt [5.3] 


— OO 


Likewise, the Fourier transform (or normalized frequency transform) y(v) of a 
discrete-time signal y[k] is a function of the form: 


Y:R—3C 


v — y(v) 
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defined for any v by: 


y(v) = Y ylkje 927 [5.4] 


k=—o0o 
We will accept that the Fourier transform of a discrete-time signal is defined (i.e. 
the above sequence converges) if the signal has finite energy. It is periodic of period 


1. It is in fact the restriction of two-sided z transform ÿ(z) to unit circle: y(v) = 
y(e)2"”) with, for any z € C: 


wz) = X yik] [5.5] 
k=—00 
The Fourier transform does not entail any loss of information. Indeed, knowing 
y (v), we can rebuild y[k] by the following reverse formula; for any k: 


1 
FIA 


ylk] = J y (v) e?" dv [5.6] 


3 

The Fourier transform (continuous-time or discrete-time) verifies the following 
fundamental problem: it transforms the convolution integral into a simple product. Let 
yi(t) and y2(t) be two real variable functions; the convolution integral (y1 © y2)(t) is 
defined for any t by: 


+oo 


(yı © y2)(t) = / yi(T) yo(t — 7) dr [5.7] 


— CO 


Likewise, let y1[k] and y2[k] be two sequences; their convolution integral (y1 ® 
y2)[k] is defined for any k by: 


+00 
(yi. ®w)k]= X ulm] yolk — m] [5.8] 


The convolution integral verifies the commutative and associative properties, and 
the neutral element is: 

— 0(t) Dirac impulse for functions (6(t) = 0 if we have t Æ 0, LS ô(t) dt = 1); 

— 0[k] Kronecker sequence for sequences (6[0] = 1, 6[k] = 0 if we have k Z 0). 

In addition, the convolution of a function or sequence with delayed neutral element 

delays it with the same quantity. It is easily verified that the Fourier transform of the 


convolution integral is the product of transforms: 


(yı 8 yo)” = Yı Yə [5.9] 
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On the other hand, the Fourier transform preserves the energy (Parseval theorem). 
Indeed, the energy of a continuous-time signal y(t) or of a discrete-time signal y[k] 


can be calculated by the square integration of the Fourier transform module y( f) or 


its normalized frequency transform Y (v): 


— continuous-time signals: i ly(t)|? dt = fies ly (f)I2 df; 


ic 
— discrete-time signals: 5%. |a[k]? = [7? |z(v)|2 av. 
2 


The function or sequence | Y |? is called a power spectrum, or energy spectral 
density of signal y because its integral (or its summation) returns the energy of signal 


y. 
The Fourier transform is defined only for finite energy signals and can be extended 
to periodic or impulse signals (with the help of the mathematical theory of distribu- 


tions). We will give a few examples below. 


EXAMPLE 5.1 (DIRAC IMPULSE). The transform of Dirac impulse is the unit function: 


6(f) = 1r(f) [5.10] 


EXAMPLE 5.2 (UNIT CONSTANT). It is not of finite energy, but admits a Fourier trans- 
form in the sense of distribution theory, which is a Dirac impulse: 


Ir(f) = 6(f) [5.11] 

EXAMPLE 5.3 (CONTINUOUS-TIME CISOID). We have the following transformation: 
y(t) =e  y(f) = 5(F — fo) [5.12] 

Therefore, this means that the Fourier transform of the frequency cisoid fo is an 
impulse centered in fo. By using the linearity of the Fourier transform, we easily 


obtain the Fourier transform of a real sine curve, irrespective of its initial phase; in 
particular: 


y(t) =cos(2x fot) P) = ZOS — fo) + OÙ + Fo)] [5.13] 





y(t) = sin(27 fot) V(F) = FU — fo) -8U + fo)] [5.14] 
EXAMPLE 5.4 (KRONECKER SEQUENCE). We immediately obtain: 


6(v) = In(v) [5.15] 
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EXAMPLE 5.5 (UNIT SEQUENCE). The Fourier transform of the constant sequence 
1Z[{k] is the impulse frequency comb ©: 


124%) =) = $ ôw- k) [5.16] 


k=—0c 
EXAMPLE 5.6 (DISCRETE-TIME CISOID). We have the following transform: 
yik] =e?" k Y(f) = Ex (vp) [5.17] 


Thus, this means that the Fourier transform of the frequency cisoid 1 is a fre- 
quency comb centered in 1. 


Very often, the spectral analysis of deterministic signals is reduced to visualizing 
the energy spectral density, but numerous physical phenomena come along with dis- 
turbing phenomena, called “noises”; for example, mechanical systems generate vibra- 
tory or acoustic signals which are not periodic and have infinite energy. 


The mathematical characterization of such signals is particularly well formalized 
in the case of stationary and ergodic random signals: 


— random: this means that, in the same experimental conditions, two different 
experiences generate two different signals. The mathematical treatment can thus be 
only probabilistic, the signal observed being considered as the realization of a random 
variable; 


— stationary: the statistical characteristics are then independent of the time origin; 


— ergodic: any statistical information is included in a unique realization of infinite 
duration. 


In any case, the complete characterization of such signals is expressed with the 
help of the combined probability law of the values taken by the signal in different 
instants, irrespective of these instants and their number. For example, for a Gaussian 
random signal, this combined law is Gauss’ probability law. For a white random signal 
(or independent), this combined density is equal to the product of marginals (to clarify 
a current confusion, we note that these two notions are not equivalent: a Gaussian 
signal can be white or not, a white signal can be Gaussian or not). In practice, we have 
the second order statistical analysis that deals only with the first and second order 
moments, i.e. the mean and the autocorrelation function. 


A discrete-time random signal y[k], k € Z is called stationary in the broad sense 
if its mean my and its autocorrelation function r,,,[«] defined by: 


my = E(y[k] 


Pyle] = ECG — my)" (yik +s] —my)) WeZ [5.18] 
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are independent of index k, i.e. independent of the time origin. o? = Tyy[0] is the 


oe Il ; 


variance of the signal considered. is the correlation coefficient between the sig- 


nal at instant k and the signal at ont k + x. It is traditional to remain limited only 
to the mean and the autocorrelation function in order to characterize a stationary ran- 
dom signal and this even if the characterization, referred to as of second order, is very 
incomplete (it is sufficient only for the Gaussian signals). 


In practice, there is only one realization y[k], k € Z of a random signal y[k] for 
which we can define its time mean (y[k]}: 


1 N 


Wk) = Jim zyr D VA [5.19] 
k=-N 


The random signal y[k] is called ergodic for the mean if mean my is equal to the 
time mean of any realization y[k] of this random signal: 


E(y[k]) = (y[k]) ergodicity for the mean [5.20] 


In what follows, we will suppose that the random signal y{k] is ergodic for the 
mean and, to simplify, of zero mean. 


The random signal y[k] is called ergodic for the autocorrelation if the autocorre- 
lation function r,,[K] is equal to the time mean (y*[k] y[k + «]) calculated from any 
realization y[k] of this random signal: 


E ("TRI ylk + #]) = (y TA] uk +s) Wee Z [5.21] 


ergodicity for the autocorrelation 


this time mean being defined for any « by: 


N 


QT y[k + K]) = lim = ~ a ay ylk + 4] [5.22] 


The simplest example of ergodic stationary random signal for the autocorrelation 
is the cisoid ae? C7 ”"+%), k € Z of initial phase @ evenly distributed between 0 
and 27, of autocorrelation function a? el?” 70, k € Z. However, the ergodicity is lost 
if the amplitude is also random. In practice, the ergodicity can be rigorously verified 
only rarely. In general, it is a hypothesis — necessary in order to obtain the second order 
statistical characteristics of a random signal considered from a single realization. 
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Under the ergodic hypothesis, the variance a of the signal considered is equal to 
the power (|y[k]|?) of any realization y: 


N 


2 _ 2\ — ys 1 2 
oy = (y) = Jim xs 2 Wi [5.23] 


i.e. the energy of the signal y multiplied by the truncation window 1_x x which is 
equal to 1 on the interval {—N,...,N} and zero otherwise, divided by the length of 
this interval when N — +00. With the help of Parseval’s theorem, we obtain: 


5 : 1 +3 nagy 
o= jim gy |, ONA 


—i 
2 


+3 1 
= : +t a 9 
=f { im, zN g1 1-NN) (v)| bay [5.24] 


2 


Hence, through formula [5.24], we have decomposed the power of the signal on the 
frequency axis, with the help of function v —> limy— oo INH (yl_n.w)~(v)|?. In 
numerous works, we define the power spectral density (or power spectrum, or spec- 
trum) of a stationary random signal by this function. However, in spite of the ergodic 
hypothesis, we can show that this function depends on the realization considered. We 
will define here the power spectral density (or power spectrum) Syy as the mean of 
this function: 











: 1 = 
Syy(v) = lim E (sw Ty 1.) wp) [5.25] 
N 2 
= —3 2r vk 
Jim E (z A 2 ie ) [5.26] 








Hence, we have two characterizations of a stationary random signal in the broad 
sense, ergodic for the autocorrelation. Wiener-Khintchine’s theorem makes it possible 
to show the equivalence of these two characterizations. Under the hypothesis that the 
sequence (kryy[K]) is entirely integrable, let: 


+00 


XO kryylk]l < 00 [5.27] 


K=— OO 


then, the power spectral density is the Fourier transform of the autocorrelation function 
and the two characterizations defined above coincide: 


Syy(v) = T yy(v) [5.28] 
+00 
= X. onde [5.29] 


K=— OO 
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Indeed, by developing expression [5.26], we obtain: 


: 1 —3 2r v(n—k 
Syy(v) = im 5 e( 5. an jiro ») 
N k=—N 














: 1 —3 2r v(n—k) 
= ONF o Tyyln — k] e” 
2N 
: |] —J2TLK 


x card {(n, k) | k =n — k and |n| < N and |k| < N} 
ka MMM 


2N+1-|K| 
2N Ikl 
sdim 2 (1- D)" ai 
—2N 
2N 





= r yy) — dim IN +1 5 |s| Tyy [A] eure 
K=—2N 
Under hypothesis [5.27], the second term above disappears and we obtain formula 


[5.29]. 


These considerations can be reiterated briefly for continuous-time signals. A conti- 
nuous-time random signal y(t),t € R is called stationary in the broad sense if its 
mean m and its autocorrelation function r,,(7) defined by: 


my = E(y(#) 
tes = B((y(t)—my)*(y(t+7)—m,))  VreR 901 


are independent of time t. 


For a realization y(t),t € R of a random signal y(t), the time mean (y(t)) is 
defined by: 


1 EH 
WO) = jim zy | Oa [5.31] 


The ergodicity for the mean is written: 


E (y(t)) = (y(t) [5.32] 


In what follows, we will suppose that the random signal y(t) is ergodic for the 
mean and, to simplify, of zero mean. 
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The random signal y(t) is ergodic for the autocorrelation if: 
E(y@®yt+)=wOyttr)) VvWeRr [5.33] 


this time mean being defined for any 7 by: 


1 T 
(w'(ult-+7)) = im oo fa ult +r) a [5.34] 


The power spectral density Syy is expressed by: 


Su) = Jim E (zlo ran) [5.35] 


LI Hal 
7 S, —327 ft 
jim E E dt [5.36] 


If function (T ryy(T)) is entirely integrable, let: 
+00 
J |T Tryy(T)| dT < œ [5.37] 


— CO 





then the power spectral density is the Fourier transform of the autocorrelation function: 


Su = roi [5.38] 
+00 
= J Tyy(T)e "ST dr [5.39] 


Power spectral density is thus a method to characterize the spectral content of a 
stationary random signal. For a white signal, the autocorrelation function is expressed, 
with q > 0, by: 


Tyy = QÔ [5.40] 


Through the Fourier transform, we realize immediately that such a signal has a 
power spectral density constant and equal to q. 


Under the ergodic hypothesis, for the discrete-time signals, the power spectral den- 
sity can be easily estimated with the help of the periodogram; given a recording of N 


points y[0], . . . , y| — 1] and based on expression [5.26], the periodogram is written: 
1 TS 
Tyy(v) = wily 1o,w-1) (v)P [5.41] 
Le 2 


[5.42] 


=| 





1 
y|k] e 27 vk 
0 





k= 
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where lo n-1 is the rectangular window equal to 1 on the interval {0,...,N — 1} 
and zero otherwise. With regard to the initial definition of power spectral density, we 
lost the mathematical expectation operator as well as the limit passage. This estimator 
is not consistent and several variants were proposed: Bartlett’s periodograms, modi- 
fied periodograms, Welsh’s periodograms, correlogram, etc. The major drawback of 
the periodogram, and more so of its variants, is the bad resolution, i.e. the capability 
to separate the spectral components coming from close frequency sine curves. More 
recently, methods based on a signal modeling were proposed, which enable better res- 
olution performances than those of the periodogram. 


5.3. Generator processes and ARMA modeling 


Let us take a stable linear process with an impulse response h, which is excited by 
a stationary random signal e, with output y: 


y=h@e [5.43] 


Hence, we directly obtain that signal y is stationary and its autocorrelation function 
is expressed by: 


Tyy =h@h* 8 Tee [5.44] 


where h*~ represents the conjugated and returned impulse response (h*~(t) = 
(h(—t))*). Through the Fourier transform, the power spectral density of y is expressed 
by: 


Syy = |h | See [5.45] 


In particular, if e is a white noise of spectrum q, then: 


Syy =4|h|” [5.46] 


Inversely, given a stationary random signal y with a power spectral density Syy, 
if there is an impulse response h and a positive real number q so that we can write 
formula [5.46], we say that this system is a generating process (or a shaper filter) for 
y. Everything takes place as if we could consider signal y as the output of a linear 
process with an impulse response h excited by a white noise of spectrum q. 


This modeling depends, however, on any impulse response h of the shaper filter. 
In order to be able to obtain a modeling with the help of a finite number of parameters, 
we know only one solution to date: the system of impulse response h has a rational 
transfer function. Consequently, we are limited to the signals whose power spectral 
density is a rational fraction in j 27 f for continuous-time and e??"” for discrete- 
time. Nevertheless, the theory of rational approximation indicates that we can always 
get as close as we wish to a function through a rational function of sufficient degrees. 
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Since the module of the transfer function of an all-pass filter is constant, such 
a filter does not enable under any circumstance to model a certain form of power 
spectral density. Hence, we will suppose that the impulse response filter h is causal 
with minimum of phase, i.e. its poles and zeros are strictly negative real parts for 
continuous-time and of a module strictly inferior to 1 for discrete-time. 


Finally, we note that formula [5.46] is redundant, i.e. the amplitude of power spec- 
trum Syy can be set either by the value of spectrum q or by the value of the filter 
gain for a given frequency. Hence, it is preferable to set the impulse response h, or its 
Fourier transform, in a certain sense. 


For discrete-time, it is usual to choose a direct transmission shaper filter (h[0] 4 0) 
and the impulse response is normalized with A[0] = 1 (in this case we say that the 
filter is monic). The equivalent for continuous-time consists of considering an impulse 
response with a Dirac impulse of unitary weight at instant 0. If this condition does not 
entail any constraint in the case of discrete-time (a pure delay being in this case an 
all-pass filter), in the case of continuous-time it implies that the power spectral density 
of the signal is not cancelled in high frequency. 


For discrete-time, the transfer function of the filter is thus written: 











: 1 + S cn] go” 
A(z) = oe = —! [5.47] 
1 + X aln] gon 


The orders na and ne characterize the structure chosen. The parameter vector 
0 = [q,al1},...,alnal], c[1], ...,c[nc]] is then necessary and sufficient to correctly 
characterize the shaper filter. 


In the case of a finite impulse response filter (na = 0), we talk of an MA (moving 
average) model because signal y[k] is expressed with the help of a weighted average 
of the input e[k] on a sliding window: 


ylk] = e[k] + cf] elk — 1] +: + cn]elk — ne] [5.48] 


The MA model is particularly capable of representing the power spectrums pre- 
senting strong attenuations in the proximity of given frequencies (see Figure 5.1). 
Indeed, if ¢(z) admits a zero of a module close to 1 and of argument 27 vo, then the 
power spectrum is almost zero in the proximity of 1%. 
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Figure 5.1. Typical power spectrum of an MA (left) 
or AR (right) model 


In the case of a single denominator (ne = 0), we talk of an AR (autoregressive) 
model, because signal y[k] at instant k is expressed with the help of a regression on 
the signal values at the previous instants: 


yl] = —a[l] y[k — 1] = -+ + — afna] ylk — na] + eff] [5.49] 


The AR model is particularly supported for two reasons. On the one hand, its esti- 
mation by maximum likelihood, with the help of a finite period recording of signal 
y, reaches an explicit solution (in the general case, we would have to call upon an 
optimization procedure). On the other hand, it is particularly capable of representing 
power spectrums presenting marked peaks in the proximity of certain frequencies, i.e. 
signals presenting marked periodicities (see Figure 5.1); for example, Prony’s ances- 
tral method, pertaining to the estimation of the frequency of noisy sine curves, deals 
with determining the argument of the poles of an AR model identified by maximum 
likelihood. 


In the general case, we speak of an ARMA (autoregressive with moving average) 
model: 


ylk] = —all] ylk — 1] — -+ + — afna] y[k — na] 
+ e[k] + c[1] elk — 1] + <- + cn] elk — ne] [5.50] 


Finally, we note that the choice of normalization h[0] = 1 is not innocent. Indeed, 
the predictor filter of one count providing ÿ[k], prediction of y[k] on the basis of 





previous observations y[k — 1], y[k — 2],..., obtained from the shaper filter [5.47] as: 
` a(z 

atk] = (1 - 35} Ju [5.51] 
č(z) 


is the optimal predictor filter, in the sense of the prediction error variance y[k] — ÿ[k], 
among all linear filters without direct transmission. This prediction error is then rigor- 
ously the white sequence e[k]. 
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5.4. Modeling of LTI systems and ARMAX modeling 


Let us take a linear time-invariant (LTI) system, of impulse response g. The 
response of this system at the known deterministic input u is g ® u, which can thus be 
calculated exactly. However, this is often unrealistic, because there are always signals 
that affect the operating mode of the system (measurement noises, non-controllable 
inputs). In a linear context, we will suppose here that these parasite phenomena 
are translated by an additional term v on the system output. The output y is then 
expressed by: 


y=gSutu [5.52] 


Hence, it is natural to propose a probabilistic context for this disturbance v and to 
consider it as a stationary random signal, admitting a representation by shaper filter; 
the output measured y is then expressed by: 


y=gSuth@e [5.53] 


where u is the known deterministic input, e an unknown white noise of spectrum q, g 
the impulse response of the system and h the impulse response of the shaper filter. We 
suppose that h and g are the impulse responses of the systems with rational transfer 
function, and, to simplify, that g does not have direct transmission. 


5.4.1. ARX modeling 


For discrete-time, the simplest relation input-output is the following difference 
equation: 


ylk] = —aft] y[k — 1] — +: — anal ylk — nal 
+ b[1] ufk — 1] +--+ + bine] ulk — ne] + e[k] [5.54] 


where the term of white noise e[k] enters directly in the difference equation. This 
model is hence called “equation error model”. Thus, the transfer functions become: 


X ofn” 





ÿ(z) = 2) = [5.55a] 
a(z) ve 
1+ Xale” 
n=1 
h(z) = = - [5.55b] 





1+ Sajen 
n=1 


We also talk of ARX modeling, “AR” referring to the modeling of the additional 
noise and “X” to the exogenous input u[k]. Given the orders na and np, the parame- 
ter vector 0 = |q, a[1],...,afn,],b[1],...,b[n;]] fully characterizes the system. This 
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model is not especially realistic but, as in the case of AR, we can show that the iden- 
tification by maximum likelihood of an ARX model leads to an explicit solution. 


5.4.2. ARMAX modeling 


The ARX model does not give much freedom on the statistical properties of the 
additional noise. A solution consists of describing the equation error with the help of 
a running average: 








ylk] = —al1] ylk — 1] —--- — afna] ylk — na] 
+ dil] ulk — 1] +--+ + b[np] u[k — ny] 
+ elk] + c[1] e[k — 1] +--+ + c[ne] e[k — ne] [5.56] 
Thus, the transfer functions become: 
: b[njz—” 
o lo. Š 
glz) = sr = [5.57a] 
a(z) ve 
1+ Soa n|z 





— [5.57b] 
1 + Soa niz" 


We talk of ARMAX modeling, “ARMA” pertaining to the modeling of the addi- 
tional noise. Given the orders na, ny and ne, the parameter vector 0 = {q,a[l1],..., 
afna], b[1],..., b[ne], cf1],..., c[ne]] fully characterizes the system. 














5.4.3. Output error model 


In the particular case of the ARMAX model where we take ¢(z) = à(z), the 
transfer functions become: 


X ofna” 


(z) = n=1 
1+ X alne” 


Hence, only an additional white noise remains on the process output. We talk 
of an output error (OE) model. Given the orders na and ny, the parameter vector 
0 = [q a[l], ...,a[na], b[1],..., b[na]] fully characterizes the system. We can show 





h(z)=1 [5.58] 
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that even if this hypothesis is false (i.e. if the additional noise if colored), the identifi- 
cation of 0 by maximum likelihood leads to an asymptotically non-biased estimation 
(but this estimation is not of minimal variance in this case). 


5.4.4. Representation of the ARMAX model within the state space 


We present here the reverse canonical form, in which the coefficients of transfer 
functions appear explicitly, which is written by assuming that d = max na, Np, Ne the 
size of the state vector x and by possibly completing sequences a, b or c by zeros: 


z[k + 1] = A z[k] + Bulk] + K e[k] 
ee = C z[k] + e[k] [5.33] 
—aji] 1 0 0 
; ROR 
A= f po 6 A, [5.60a] 
: : 0 
—ald—1] 0 0 1 
—ald] 0 0 
b[1] c[1] — afl] 
ae | K- | [5.60b] 
bja eld) — ald 
C=[1 0 «+ e+ e+ 0] [5.60c] 


5.4.5. Predictor filter associated with the ARMAX model 


The one count predictor filter providing g[k], prediction of y[k] on the basis of 
the previous observations y[k — 1], ylk — 2], etc., and of input u[k], u[k — 1], ete., is 
obtained as: 








ilk] = T 4 (1 = — ) ate [5.61] 


It is the optimal predictor filter, in the sense of the second momentum of the pre- 
diction error y[k] — ÿ[k], among all linear filters without direct transmission on y[k]. 
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This prediction error is then rigorously the white sequence e[k]. This is the basis of 
the identification methods for ARMAX models through the prediction error method. 
Given an input-output recording of N points (u[k],y[k])o<k<n-—1, we will choose 
among all the predictor filters of the form [5.61], which is parameterized by 0 and 
providing a prediction yg[k], the one that minimizes the square mean of the prediction 


error: 
N-1 


a . i F: 2 
Ô = arg min = 2 lulk] — ĝo lk] [5.62] 


This estimator is in fact the estimator of maximum likelihood in the case of a 
Gaussian white noise hypothesis. We note that the hypothesis of a shaper filter with 
minimum phase leads to a stable causal predictor. 


5.5. From the Markovian system to the ARMAX model 


The representation within the state space [5.59], in which the unique noise 
sequence e[k] intervenes both on the equation of state and on the equation of meas- 
urement, is called “innovation form” or “filter form”. However, by generalizing to the 
study of systems with m inputs (u[k] is a vector with m lines) and p outputs (y[k] is a 
vector with p lines), the random contributions are usually represented with the help 
of two noises v[k] (the noise of the system) and w[k] (the measurement noise) in a 
representation within the state space of size d as follows: 
alk +1] = A z[k] + Bulk] + v[k] [5.63] 
ylk] = C afk] + wih] | 


where v[k] and w[k] are two white noises of spectra Q and R respectively and of 
interspectrum S, i.e.: 


E (v*[k] vT [k + «]) = Q ôfk] 
E (w* [k] wT [k + x]) = R ô[k] [5.64] 
E (v*[k] wT [k + «]) = 5 ôlk] 


Noise v[k] generally represents the uncertainties on the process model or the dis- 
turbances on the exogenous input. Noise w[k] generally represents the measurement 
noise. We talk of a Markovian system. 


However, Kalman’s filtering (see Chapter 7) enables us to show that it is always 
possible to represent such a system in the innovation form, as: 


[5.65] 


n 1] = A ĉ[k] + Bulk] + K elk] 
ylk] = C ĉ[k] + elk] 
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where ĉ[k], e[k] and K are the state prediction, the innovation (and we can prove it 
is white) and the gain of Kalman’s stationary filter operating on model [5.63]. Such a 
form is minimal, in the sense that it entails only as many noises as measurements. 


In the particular mono-input-mono-output case, we find the ARMAX model, 
whose canonical form is given in [5.60]. 
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Chapter 6 


Kalman’s Formalism for State 
Stabilization and Estimation 


We will show how, based on a state representation of a continuous-time or 
discrete-time linear system, it is possible to elaborate a negative feedback loop, by 
assuming initially that all state variables are measurable. Then we will explain how, 
if it is not the case, it is possible to build the state with the help of an observer. 
These two operations bring about similar developments, which use either a pole 
placement or an optimization technique. These two approaches are presented 
successively. 


6.1. The academic problem of stabilization through state feedback 


Let us consider a time-invariant linear system described by the following 
continuous-time equations of state: 


x(t) = Ax(t) + Bult) ; x(0) #0 [6.1] 


where xe R” is the state vector and u e R” the control vector. The problem is 
how to determine a control that brings x(t) back to 0, irrespective of the initial 
condition x(0). In this chapter, our interest is mainly in the state feedback controls, 
which depend on the state vector x. A linear state feedback is written as follows: 


Chapter written by Gilles DUC. 
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u(t) = -K x(t) + e(t) [6.2] 


where K is an mxn matrix (Figure 6.1) and signal e(t) represents the input of the 
looped system.. 


The equations of the looped system are written as follows: 


x(t)=(A-BK) x(t) + Be(t) [6.3] 


Kj" 


Figure 6.1. State feedback linear control 





Hence, the state feedback control affects the dynamics of the system which 
depends on the eigenvalues of A— BK (let us recall that the poles of the open loop 
system are eigenvalues of A; similarly, the poles of the closed loop system are 
eigenvalues of A- BK). 


In the case of a discrete-time system described by the equations: 
Xpay =F xp + Gu, 3 x9 #0 [6.4] 
the state feedback and the equations of the looped system can be written: 
up =—-K xp + ey [6.5] 
Xe =(F-GK) x, + Geg 3 x9 #0 [6.6] 


so that the dynamics of the system depends on the eigenvalues of F- GK. 


The research for matrix K can be done in various ways. In the following section, 
we will show that under certain conditions, it makes it possible to choose the poles 
of the looped system. In section 6.4, we will present the quadratic optimization 
approach, which consist of minimizing a criterion based on state and control vectors. 
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6.2. Stabilization by pole placement 
6.2.1. Results 


The principle of stabilization by pole placement consists of a priori choosing the 
poles preferred for the looped system, i.e. the eigenvalues of A-BK in 
continuous-time (or of F-GK in discrete-time) and then to obtain matrix K 
ensuring this choice. The following theorem, belonging to Wonham, specifies on 
which condition this approach is possible. 


THEOREM 6.1— a real matrix K exists irrespective of the set of eigenvalues 
{An}, real or conjugated complex numbers chosen for A-BK (for 
F-GK respectively) if and only if (A, B) ((F, G) respectively) is controllable. 


Demonstration. It is provided for continuous-time but it is similar for discrete-time 
as well. Firstly, let us show that the condition is sufficient: if the system is not 


controllable, it is possible, through passage to the controllable canonical form (see 
Chapter 2), to express the state equations as follows: 


“y(t Ay, A t B 
earl 11 A | [6.7] 
X2 (0) 0 Anj 26) 0 
By similarly decomposing the state feedback [6.2]: 


A x, (¢) 
u)=-(K Ko) |) + O [6.8] 


X2 t) 


the equation of the looped system is written: 


r : ~ - BK; Ay - a & al rn o e(t) [6.9] 
à) (0) 0 A5 X2 (t) 0 


so that, the state matrix being block-triangular, the eigenvalues of the looped system 
are the totality of eigenvalues of sub-matrices 4); - BK, and A). The 
eigenvalues of the non-controllable part are thus, by all means, eigenvalues of the 
looped system. 


Let us suppose now that the system is controllable. In this part, we will assume 
that the system has only one control; however, the result cannot be extended to the 
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case of multi-control systems. As indicated in Chapter 2, the equations of state can 
be expressed in companion form: 


0 1 0 se 0 0 
x(t) 0 0 1 Way 46 x(t) 
: |=| 0 0 0  ‘ 0 > J+lolu@© [6.10] 
x, (©) : : e “tT x,(t) 0 
An ~ 4-1 ~4n-2 | =A 1 


By writing the state feedback [6.2] as: 


xı (t) 
u(t)=—(ky kyi = kK) i lt et) [6.11] 
Xn (6) 


the equation of the looped system remains in companion form: 


3 xı (£) . 
= ; A OM : 1+] "Jeo [6.12] 
xp (t) 


x (¢) 


+ . 
Xn -k ce. -a-k 


so that the characteristic polynomial of the looped system is written: 
det (AI - (A- BK)) = A" + (a \+k 1) Ar Een ae ntk n) [6.13] 


We see that, by choosing the state feedback coefficients, it is possible to 
arbitrarily set each characteristic polynomial coefficient so that we can arbitrarily set 
its roots, which are precisely the eigenvalues of the looped system. In addition, 
matrix K is thus uniquely determined. 


Theorem 6.1 thus shows that it is possible to stabilize a controllable system 
through a state feedback (it is sufficient to take all A; with a negative real part in 
continuous-time, inside the unit circle in discrete-time). More generally, it shows 
that the dynamics of a controllable system can be randomly set for a linear state 
feedback. 
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However, in this chapter we will not deal with the practical issue of choosing 
the eigenvalues. Similarly, we note that for a multi-variable system (1.e. a system 
with several controls), the choice of eigenvalues is not enough in order to uniquely 
set matrix K. Degrees of freedom are also possible for the choice of the eigenvectors 
of matrices A-B K or F -G K . Chapter 14 will tackle these aspects in detail. 


6.2.2. Example 


Let us consider the system described by the following equations of state: 


x(t) = 0 1 xı (t) 0 
- à 7 é a e J i ao 


O [6.14] 
X1 t 
y(t) = (1 o | 
x2(0) 
We can verify that this system is controllable: 
0 1 
rank (B 4B) = nn ) =2 [6.15] 
We obtain, with K = (ki k): 
=] 
det (A I — (A-BK)) = =4°+(1+k)A+k 6.16 
et ( ( )) E Ar (1+ ky) 1 [6.16] 








and by identifying with a second order polynomial written in the normalized form: 


ky = Wo" 


[6.17] 
kz = 2E wo -1 


2 +0+k)A+k = À +26 WM, À + Wy < | 


Figure 6.2 shows the evolution of the output and the control, in response to the 
initial condition x(0) = (1 D , for different values of Wg and & the higher wo is, 
the faster the output returns to 0, but at the expense of a stronger control, whereas 
the increase of € leads to better dynamics. 
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Figure 6.2. Stabilization by pole placement 


6.3. Reconstruction of state and observers 
6.3.1. General principles 


The disadvantage of state feedback controls, like the ones mentioned in the 
previous chapter, is that in practice we do not always measure all the components of 
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state vectorx. In this case, we can build a dynamic system called observer, whose 
role is to rebuild the state from the information available, i.e. the controls u and all the 
available measures. The latter will be grouped together into a z vector (Figure 6.3). 


u System (x) Z 3 
| 





Figure 6.3. The role of an observer 


6.3.2. Continuous-time observer 


Let us suppose that the equations of state are written: 

x(t) = Ax(t) + Bult 

(t) (0) (t) [6.18] 
z(t) = C x(t) 


The equations of a continuous-time observer, whose state is marked X(f), are 
calculated on those of the system, but with a supplementary term: 


ome X(t) + B ult) + L (2(t)- (6) [6.19] 


LD =C x(t) 


The observer equation of state includes a term proportional to the difference 
between the real measures z(t) and the reconstructions of measures obtained from the 
observer’s state, with an Z gain matrix. In the case of a system with n state variables 
and q measures (i.e. dim( x ) = dim( x)= n, dim(z)= q), Lisan nxq matrix. 


Equations [6.19] correspond to the diagram in Figure 6.4: in the lower part of the 
figure we see equations [6.18] of the system we are dealing with. The failure term 


with the Z gain matrix completes the diagram. 


Hence, equations [6.19] can be written as follows: 
iO = (A-LC) X(t) + Bult) + L z(t) [6.20] 


which makes the observer look like a state system x(t), with the inputs u(t) and 
z(t) and with the state matrix Æ- LC. We infer that the observer is a stable 
system if and only if all the eigenvalues of A — LC are strictly negative real parts. 
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Figure 6.4. Structure of the observer 


Let us now consider the reconstruction error €(t) that appears between x(t) 
and x(t). Based on [6.18] and [6.19], we obtain: 


é=X-% =(Ax + Bu) —- (A+ Bu+LCx-LCÀ) 
lt) = (A-LC) é(t) [6.21] 


and hence the reconstruction error €(t) tends toward 0 when f tends toward infinity 
if and only if the observer is stable. In addition, the eigenvalues of A — LC set the 
dynamics of e(t). Hence, the problem is to determine an L gain matrix ensuring 
stability with a satisfactory dynamics. 


6.3.3. Discrete-time observer 


The same principles are applied for the synthesis of a discrete-time observer; if 
we seek to rebuild the state of a sampled system described by: 


Xp4, =F x, + Gu 
| k+l k k [6.22] 
Zk = C xy 
the observer’s equations can be written in the two following forms: 
Ery FF À + G u + Lg 2 
de "k k (z= 2) [6.23] 
ZE C Xk 


Xk44 = (F -LC) x, a Gu, F Lz [6.24] 
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From equations [6.22] and [6.23] we infer that the reconstruction error verifies: 
Ek+1 = (FLO) € [6.25] 


In order to guarantee the stability of the observer and, similarly, the convergence 
toward 0 of error €}, matrix L must be chosen so that all the eigenvalues of 
F-LC have a module strictly less than 1. 


According to [6.23] or [6.24], we note that the observer operates as a predictor: 
based on the information known at instant k, we infer an estimation of the state at 
instant k +1. Hence, this calculation does not need to be supposed infinitely fast 
because it is enough that its result is available during the next sampling instant. 


6.3.4. Calculation of the observer by pole placement 


We note the analogy between the calculation of an observer and the calculation 
of a state feedback, discussed in section 6.1: at that time, the idea was to determine a 
K gain matrix that would guarantee to the looped system a satisfactory dynamics, the 
latter being set by the eigenvalues of 4 - BK (or F — GK for discrete-time). The 
difference is in the fact that the matrix to determine appears on the right in product 
BK (or GK ), whereas it appears on the left in product LC. 


However, the eigenvalues of A-LC are the same as the ones of A’—C’L’, 
expression in which the matrix to determine L’ appears on the right. Choosing the 
eigenvalues of A’—C’L’ is thus exactly a problem of stabilization by pole placement: 
the results listed in section 6.1 can thus be applied here by replacing matrices A and 
B (or F and G) by AT and C7 (or F T and C’) and the state feedback K by L’. 


Based on Theorem 6.1, we infer that matrix L exists for any set of eigenvalues 
{A,,---,4,} chosen a priori if and only if (AT CT ) is controllable. However, we 
can write the following equivalences: 


(AT, CT) controllable rank[CT AT CT (47 cl jen © 


C 


CA 
rank =n & (C, A) observable [6.26] 


C an7! 
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Hence, we can arbitrarily choose the eigenvalues of the observer if and only if 
the system is observable through the measures available. Naturally, the result 
obtained from equation [6.26] can be used for the discrete-time case by simply 
replacing matrix A with matrix F. 


6.3.5. Behavior of the observer outside the ideal case 


The results of sections 6.3.2 and 6.3.3, even if interesting, describe an ideal case 
which will never be achievable in practice. Let us suppose, for example, that a 
disturbance p(t) is applied on the system [6.18]: 


iC = A x(t) + Bu(t) + E p(t) [6.27] 


z(t) = C x(t) 


but observer [6.19] is not aware of it and then a calculation identical to the one in 
section 6.3.2 shows that the equation obtained for the reconstruction error can be 
written: 


&(t) = (A-LC) €(t) + E p(t) [6.28] 


so that the error does not tend any longer toward 0. If p(t) can be associated with a 
noise, Kalman filtering techniques can be used in order to minimize the variance of 
e(t). We provide a preview of this aspect in section 6.5.3. 


If we suppose that modeling uncertainties affect the state matrix of system 
[6.18], so that a matrix A'# A intervenes in this equation, then the reconstruction 
error is governed by the following equation: 


E(t) = (A-L OC) Eelt) + (A-A xlt) [6.29] 
so that there again the error does not tend toward 0. 


NOTE 6.1.— observers [6.19] or [6.23] rebuild all state variables, operation that may 
seem superfluous if the measures available are of very good quality (especially if the 
measurement noises are negligible): from the moment the observation equation 
already provides q linear combinations (that we will suppose independent) of state 
variables, it is sufficient to reconstitute n — q , independent from the previous ones. 
Therefore, we can synthesize a reduced observer, following an approach similar to 
the one presented in these sections (see [FAU 84, LAR 96]). However, the physical 
interpretation underlined in section 6.3.2, where the observer appears naturally as a 
physical model of the system completed by a retiming term, is lost. 
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6.3.6. Example 


Let us consider again the system described by equations [6.14]: 


x(t)\ (0 x1 (6) 
oll 7 aa (e n 


y(t) = (1 0 0 | 


We can verify that this system is observable: 


rank = rank =2 [6.30] 
CA 0 1 


The observer’s equations can be written, by noting L = (h4 bL) Fe 


X (2) x(t) 

(A?) [ = r (e osli Joo- £(0) 

(74 1 0O 

if SE fi) oh | xO [6.31] 


The characteristic polynomial of the observer is written: 


A+ -1 


D el [=ar A+ (h +h) [6.32] 


det (AI - (A-L o=] 


and by identifying with a second order polynomial written in normalized form: 


A2? +(+h)A +h +h) = P+2EMAt+a0) © 


l = 2E wo -1 
[6.33] 
l = MQ -2 é a +1 
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Figure 6.5. Observer by pole placement 


Figure 6.5 shows the evolution of the two state variables in response to the 
initial condition x(0) = (1 D? and the evolutions of the state variables of the 
observer initialized by x(0) = (0 oy for different values of Wg: the higher wọ is, 
the faster the observer’s state joins the systems’ state. 
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6.4. Stabilization through quadratic optimization 
6.4.1. General results for continuous-time 
Let us consider again system [6.1], with an initial condition x(0) + 0. The 


question now is to determine the control that enables to bring back x(t) state to 0, 
while minimizing the criterion: 


J = GOT Ox(t) + ut)’ Rud) dt [6.34] 


where Q and R are two symmetric matrices, one positive semi-defined and the 
other one positive defined: 


O=0'>0 , R=R' >0 [6.35] 


(hence, we have x? Ox>0 Vx and u/Ru>0 Vu# 0). Since matrix Q is 
symmetric, we will write it in the form Q = HTH, where H is a full rank 
rectangular matrix. 


The solution of the problem is provided by Theorem 6.2. 


THEOREM 6.2.- if conditions [6.35] are verified, and also if. 


(A, B) is stabilizable 
[6.36] 


(H, A) is detectable 


there is a unique, symmetric and positive semi-defined matrix P, which is the 
solution of the following equation (called Riccati’s equation): 


PA+A'P-PBR'B'P+Q=0 [6.37] 


The control that minimizes criterion [6.34] is given by: 


[6.38] 


u(t)=-K x(t) 
K=R BP 
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It guarantees the asymptotic stability of the looped system: 


x(t) = (A4-BK)x(t) is such that: Vx(0) x(t) — 0 [6.39] 
t—oo 


The value obtained for the criterion is then J’ = x(0)7 P x(0). E 


Elements of demonstration 


The condition of stabilizability of (4, B) is clearly a necessary condition for the 
existence of a control that stabilizes the system. We will admit that it is also a 
sufficient condition for the existence of a matrix P, symmetric and positive semi- 
defined, solution of Riccati’s equation [MOL 77]. If, moreover, (H, À) is detectable, 
we show that this matrix is unique [ZHO 96]. 


If (A,B) is stabilizable, we are sure that there is a control for which J (whose 
upper bound is infinite) acquires a finite value: since the non-controllable part is 
stable, any state feedback placing all the poles of the controllable part in the left 
half-plane ensures that x(t) and u(t) are expressed as the sum of the exponential 
functions that tend toward 0. 


Mutually, any control u(t) leading to a finite value of J ensures that 
x(t)? Q x(t) tends toward 0, and hence that H x(t) tends toward 0. Since (H, A) is 


detectable, this condition ensures that x(t) tends toward 0. 


Hence, let us define the function V(x(t)) = x(t)? P x(t) where P is the positive 
semi-defined solution of [6.37]. We obtain: 


<= (Ax + Bu)! Px + xT P(A x + Bu)= 
t 


= x (ATP + PA)x + ul BlPx + x! PBu= 


Il 


x’ (PBR BTP - Q)x + u’ BTPx + x’ PBu= 


(ul + x! PBR") Ritu + RIB! Px) =e RY = x’ Ox= 


= (u-u)! R(u-u ) à u! RUE x Ox 
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by noting u” the control given by [6.38]. For any stabilizable control u(t) we have: 


J = ” (x? Ox + u Ru)dt = 
0 


-peZ (u-u )" R(u-u))dt = 
t 


Il 


x(0)7 Px(0) + f° (u-u) R(u-w")) dt 


Since R is positive defined, J is minimal for u(t) = u(t) and thus has the 
announced value. As indicated in the third section, the detectability of (H,A) 
ensures the asymptotic stability of the looped system. 


NOTE 6.2.— when P>0, function V(x(t)), which is then positive defined and 
whose derivative is negative defined, is a Lyapunov function (condition P > 0 is 
verified if and only if (H, A) is observable [MOL 77]). 


6.4.2. General results for discrete-time 


The results enabling the discrete-time quadratic optimization are the same, with a 
few changes in the equations describing the solution. Let us consider the system 
[6.4] and the criterion to minimize: 


J = D Or Q Xe + uk Rug) [6.40] 
k=0 


matrices Q and R having the same properties as in the previous section 
(particularly with the conditions [6.35]). The solution of the problem is provided by 
Theorem 6.3 [KWA 72]. 


THEOREM 6.3.— if conditions [6.35] are verified and also if: 


fe G) is stabilizable [6.41] 


(H,F) is detectable 
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there is a unique matrix P, symmetric and positive semi-defined, solution of the 
following equation (called discrete Riccati’s equation): 


FTPF - P -F’PG(R + G'’PG)'G'PF + Q=0 [6.42] 


The control that minimizes criterion [6.40] is given by: 


ug = -K Xk 
x 7. [6.43] 
K=(R + G'PG)'G'PF 
It guarantees the asymptotic stability of the looped system: 
Xpay =(F -GK)x, is such that: Vxg x, — 0 [6.44] 
k— œ 
The value obtained for the criterion is then J = x(0)7 P x(0). E 


6.4.3. Interpretation of the results 


The results presented above require the following notes: 


— the optimization of a criterion of the form [6.34] or [6.40] does not have to be 
considered as a goal in itself but as a particular means to calculate a control, which 
has the advantage of leading to a linear state feedback; 


— however, we can attempt to give a physical significance to this criterion: it 
creates a balance between the objective (we want to make x return to 0, the 
evolution of x penalizes the criterion through matrix Q ) and the necessary expense 
(the controls u applied penalize the criterion due to matrix R ); 


— the choice of weighting matrices Q and R depends on the user, as long as 
conditions [6.35] and [6.36] or [6.41] are satisfied. Without getting into details, it 
should be noted that if all coefficients of Q increase, the evolution of x is even 
more penalized, at the expense of the evolution of u controls; thus the optimization 
of the criterion leads to a solution ensuring a faster dynamic behavior for the looped 
system, but at the expense of stronger controls. Inversely, the increase of all 
coefficients of R will lead to softer controls and to a slower dynamic behavior; 


— the two conditions in [6.36] or [6.41] are not of the same type: in fact we can 
always fulfill the condition of detectability by a careful choice of matrix Q. 
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However, the available controls impose matrix B (or G), so that there is no way of 
acting on the condition of stabilizability; 


—the criterion optimization provides only matrix K of expression [6.38] or 
[6.43]. In the absence of input (e = 0), the control ensures the convergence towards 
the state of equilibrium x = 0. Input e makes the system evolve (in particular a 
constant input makes it possible to orient the system toward another point of 
equilibrium, different from x = 0). 


6.4.4. Example 


Let us consider system [6.14] and the criterion: 
_ pe 2 2 _fg 0 = 
J= b (q v(t)” +ru(t)’)dt and Q = 0 0 and R=r [6.45] 


where q and r are positive coefficients. Hence, we have H = (Ja 0) and we can 
verify that (H, 4) is observable : 


rank Jen va 7 =2 [6.46] 
HA 0 a 


In section 6.2.2 we saw that (4, B) is controllable, so that hypotheses [6.36] are 
verified. The positive semi-defined solution of Riccati’s equation and the state 
feedback matrix are written by noting a@=./qgr and B= q/r: 


p-f 11+2B a | 
l a r(-1+/1+2B) 


K=(B -1+41+2ß) 


We note that the latter depends only on the ratio g/r and not on q and r 
separately. Figure 6.6 shows the evolution of the control and the output, in response 
to the initial condition x(0) = (1 DT, for different values of q/r: the higher g/r is, 
the faster the output returns to 0, but at the expense of a stronger control. 


[6.47] 
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Figure 6.6. Stabilization by quadratic optimization 
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6.5. Resolution of the state reconstruction problem by duality of the quadratic 
optimization 


6.5.1. Calculation of a continuous-time observer 


The calculation of an observer (section 6.3) must ensure the convergence toward 
0 of the reconstruction error €(t) , described by: 


ët)=(A- LC) &(t) [6.48] 


or, similarly, that the eigenvalues of A—ZLC are all of negative real parts. 
However, in section 6.3.4 we saw that the calculation of an observer implies the 
calculation of a state feedback when we transpose matrices A and C of the system. 
In other words, if we define the following imaginary system: 


W(t) = ATNA + Cv) [6.49] 
with a state feedback, noted by v(t) = — i 7(t) , we obtain the looped system: 
it) = (AT - CTL! yt) [6.50] 


whose state matrix has the same eigenvalues as the observer [6.20]. Hence, there is 
equivalence between the stability of the looped system [6.50] and the stability of the 
observer. 


In order to calculate matrix LT , we can use the quadratic optimization approach 
presented in section 6.4. Let us define for system [6.49] a quadratic criterion: 


J = GO TV NA + vO" W vo) dt [6.51] 


where V and W are two symmetric matrices — the first is semi-positive and the 
second is positive definite: 


V=VT =ss' >0,W=wW' >0 [6.52] 


By applying Theorem 6.2, and by using the duality between stabilizability and 
detectability, we immediately obtain the following result. 
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THEOREM 6.4.- if conditions [6.52] are verified and also if. 


C,A)  detectabl 
(4° ,C*) stabilizable { ) detectable [6.53] 


UT,AT) detectable (A,J) stabilizable 


there is a unique matrix M, which is symmetric and positive semi-defined, solution 
of Riccati’s equation: 


MA'+AM-MC'W'!CM+V=0 [6.54] 
The gain matrix: 
I? =w'cmMm e L=MC hw"! [6.55] 


guarantees the asymptotic stability of the observer. E 


We should also remember that the eigenvalues of À — LC set the dynamics of 
the observer and of the reconstruction error £(f). 


Hence, determining the observer depends only on the choice of the two new 
weighting matrices V and W. Like matrices Q and R of the problem of stabilization 
by quadratic optimization (section 6.4), their choice makes it possible to adjust the 
dynamics of the observer. It should particularly be noted that the increase of 
coefficients of V (W respectively) leads to a faster dynamics (slower, respectively). 


6.5.2. Calculation of a discrete-time observer 


The same approach is applicable for a discrete-time observer, for the synthesis of 
an observer described by equation [6.23] or [6.24]: by using the results in section 
6.4.2, we obtain the results below. 


THEOREM 6.5.— if conditions [6.52] are verified and also if: 


(C,F) detectable 
[6.56] 


(F,J) stabilizable 
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there is a unique matrix M, symmetric and positive semi-defined, solution of the 
Riccati’s discrete equation): 


FMF!’ -M-FMC'(W+CMC')'CMF' +V =0 [6.57] 
The gain matrix: 
L=FMCT(W+cMcT)! [6.58] 


guarantees the asymptotic stability of the observer. = 
As for continuous-time, determining the observer depends only on the two 
matrices V and W: they set the eigenvalues of F-LC, on which depend the 
dynamics of the observer and the reconstruction error €}. 
6.5.3. Interpretation in a stochastic context 
In this section, we will give a short preview on the techniques of state 
reconstruction that can be used in a stochastic context!. It is interesting to realize 


that, in a certain measure, the results are obtained by quadratic optimization. 


The system whose state we seek to rebuild is supposed to be described by: 


Aa = A x(t) + B u(t) + v(t) [6.59] 

z(t) = C x(t) + w(t) 

where v(t) and w(t) are white noises, of zero average and of variances: 
Evvit)vt)'}=V ; Efw()wa)'}=w [6.60] 


Noise v(t) can be interpreted as a disturbance occurring at the system input and 
w(t) as a measurement noise. The problem is to rebuild the state of the system 
through an observer of the form [6.19]. 


1 More complete developments are proposed, for example, in [LAR 96]. 
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[KWA 72] shows that the observer that ensures an average zero error 
E {x(t) — O} and that optimizes the variance: 


Ln) = ELOA) ; eO = x) - 80) [6.61] 
is given by the following equations: 


(= A X(t) + B u(t) + LA ZA- CA) 
£(0) = E{x(0)} [6.62] 
Li)= X(t) Cw! 


where X(t) verifies Riccati’s differential equation: 


X(t) = X(t) AP + AX) = EH) CTW CX) +V een 
(0) = E{(x(0) — £(0)) (x(0) - %(0))"} 


This observer is called Kalman’s filter and we note that its gain varies in time. 
However, we have the following convergence result [K WA 72]: 


THEOREM 6.6— if conditions [6.56] are verified, X(t) solution of [6.63] tends, 
when t—>œ, toward the unique positive semi-defined symmetric solution M of 
Riccati’s algebraic equation [6.54]. E 


Hence, we can interpret the observer determined in section 6.5.1 as the 
permanent state of Kalman’s filter that optimizes the state reconstruction, 
considering the particular hypotheses on the noises that interfere on the system. 


The same results are obtained for discrete-time [KWA 72], if we assume the 
system described by: 


ee =F x, + Gu, + VE | E{vk v }=V [6.64] 


Zk =C xk + Wy Etw, we }=W 


Under conditions [6.56], the permanent state of Kalman’s filter is there also the 
observer determined in section 6.5.2. 
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6.5.4. Example 


Let us consider again the system described by equations [6.14]: 


x, (0) _ 0 1 xı (t) à. 0 u(t) 
#0) lo -ila U 


w@ = (1 ofzo] 


x2 (6) 


and let us calculate an observer with the weighting matrices: 


V = ie s d W= 6.65 
= Ons or me an = W [6. ] 


where v and w are positive coefficients. We can verify that (4, J ) is controllable: 


rank (J AJ)= rank ($ s) =2 


[6.66] 


In section 6.3.6 we saw that (C, 4) is observable, so that hypotheses [6.53] are 
verified. The equations of the observer are the general equations [6.19], with Z 
solution of equations [6.54] and [6.55]. 


Figure 6.7 shows the evolution of the two state variables in response to the initial 
condition x(0) = (1 D? and those of the state variables of the observer initialized 
by x(0) = (0 oy, for different values of the ratio v/w: the higher v/w is, the faster 
the observer’s state returns to the state of the system. 
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Figure 6.7. Observer by quadratic optimization 
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6.6. Control through state feedback and observers 
6.6.1. Implementation of the control 


The results of the previous sections enable to determine a control law for a 
system whose state is not entirely measured. We suppose that its equations are: 


ee = Ax(t) + Bu(t) [6.67] 


z(t) = C x(t) 


The modal control, or the optimization of a quadratic criterion, provides a state 
feedback control, whose general form is the following: 


u(t) = -K x(t) + e(t) [6.68] 


Similarly, the modal approach, or the choice of two weighting matrices V and 
W provides an L gain observer. 


The system control [6.67] is obtained by implementing the state feedback not 
from the state x(t) of the system, which is not accessible, but from its 
reconstruction x(t) provided by the observer. Hence, it is given by the following 
equations, which correspond to the diagram in Figure 6.82: 


| R(t) = A(t) + Bult) + L(z(t)- CR) [6.69] 


u(t) = —K x(t)+ e(t) 





a 

















Figure 6.8. Control by state feedback and observer 





2 The name LQG (that stands for Linear-Quadratic-Gaussian) control is sometimes used to 
designate this type of control. It obviously refers to one of the methods used for calculating 
the state feedback, and to the stochastic interpretation of the reconstruction carried out by the 
observer (section 6.5.3). 
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In the case of a sampled system, described by the equations: 


o +Guk [6.70] 


Zk = Cx, 


the control law obtained by implementing the state feedback from its reconstruction 
x; provided by the observer (following here too the principle in Figure 6.8) is given 
by: 


fee Xp+G up +L (zx —C Xz) [6.71] 


uk =-K k + ex 


6.6.2. Dynamics of the looped system 


Let us briefly discuss now the equations of state of system [6.67] looped by the 
control law [6.69]. For that we need 2n state variables because the system, as the 
observer, is of order n . Let us choose as state vector of the group the set of vectors 
x and € together. We already know equation [6.21] describing e(t). For x(t) we 
obtain equation [6.72]: 


k(t) = Ax(t)+ B(-K X(t) + e(t)) = Ax(t)+ B(-K x(t)+ K Elt) + elt) 
x(t) = (4-BK) x(t) + BKe(t) + Be(t) [6.72] 
By considering both equations [6.72] and [6.21], we thus obtain: 
“ -BK BK le e 
| = + e(t) [6.73] 
€(t) 0 A- LC )\ e(t) 0 
In the case of a sampled system, we similarly obtain: 
SO" erell) 
= +| Jez [6.74] 
Ek+1 0 F-LC )\ €k 0 


The block-diagonal structure of state matrices obtained makes it possible to state 
that the looped system has as eigenvalues the reunion of the eigenvalues of 4-BK 


> 





= 
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and of A-LC (or F-GK and F-LC in discrete-time). The former have been 
chosen or set during the calculation of the state feedback (and in this case we have 
control dynamics) whereas the latter have been chosen or set during the calculation 
of the observer (and in this case we talk of reconstruction dynamics). 


6.6.3. Interest and limitations of this result 


The previous example justifies the approach adopted which consists of 
calculating the state feedback and the observer independently. 


In addition, if the observer operates in perfect conditions, it theoretically 
maintains e(t) at 0, so that equations [6.73] and [6.74] become identical to 
equations [6.3] and [6.6] that we obtain if we can directly apply the state feedback! 


This result must be analyzed critically. First of all it implies that the observer and 
the system operate exactly in the same conditions. However, we saw in section 6.3.5 
that in the presence of an interference applied to the system, error e(t) does not tend 
toward 0 anymore. The separation of dynamics remains, however, verified in this 
case. 


On the other hand, let us suppose that modeling uncertainties affect the system 
state matrix. In section 6.3.5 we established equation [6.29] that governs the 
reconstruction error. The equation of the looped system becomes: 


ta 7 o BK BK F? i NT [6.75] 
è(t) Ž-A A-LC) eW 0 


This time we see that all the system’s dynamics are affected and that we cannot 
state anything with respect to the position of its eigenvalues. We must also add that, 
in certain cases, even a very small error between the two matrices can be enough to 
change the eigenvalues significantly. 


This last point shows that it is indispensable to verify that the control is 
sufficiently robust to face the model errors. A detailed analysis of the robustness 
properties is outside the context of this work; for more details on this aspect, see 
[DUC 99]. A first approach consists of ensuring that the stability margins of the 
looped system are sufficiently high. In what follows we will present this aspect. 
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6.6.4. Interpretation in the form of equivalent corrector 


The state feedback return and the observer are illustrated in Figure 6.8 and given 
in continuous-time by equations [6.69]. If, in order to simplify, we note e(t) = 0, 
equations [6.69] are written in the following form, which is obtained by carrying the 
expression of u(t) in the first equation: 


es = (4- BK -LC)i(t) + L z(t) re 


u(t) = -K X(t) 


Equations [6.76] correspond to a system with input z(t) and output u(t), Le. 
what we usually call a corrector. By using the Laplace transform, let us calculate the 
transfer functions of the system and corrector. From equations [6.67] describing the 
system, we obtain: 


Z(p)= G(p)U(p) with G(p)=C(p1-A)'B [6.77] 
and from equations [6.76] describing the corrector: 
U(p)=-K(p)Z(p) with K(p)=K(pI-A+BK+LC)'L [6.78] 


The looped system can thus be represented by the negative feedback loop in 
Figure 6.9. The corresponding open loop transfer function is written: 


To (p) = K(p) G(p) = K (pI- A+ BK+LC)'!L C(p1-A)!B [6.79] 





Corrector 


Figure 6.9. Analysis of stability margins 
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If the system has only one control, Tgo(p) is a scalar transfer function, and its 
stability margins can be determined from its frequency response Tgo(jœ), traced 
in Bode, Black or Nyquist’s planes. This analysis enables to verify that the control, 
which was determined by adopting a purely algebraic approach, leads to satisfactory 
stability margins. 


The same approach is applied for a discrete-time system. The equations obtained 
for the state feedback control and the observer are written, by considering e =0: 


oe K,tL % [6.80] 


Ux =-K k 


The application of the z-transform to equations [6.70] and [6.80] makes it possible 
to obtain transfer functions G(z) and K(z) of the system and of the corrector, then 
the open loop transfer function: 


Tpo(Z)=K(2) G()=K (zI-F+GL+KCY!K C(z1-F)!G [6.81] 


Hence, determining the stability margins can be done from the frequency 
response Tgole™T) in Bode, Black or Nyquist planes. 


6.6.5. Example 


Let us consider again system [6.14] which was our example throughout this 
chapter. We will use it again to illustrate what follows: 


—the state feedback calculated in section 6.2.2 by the modal approach, with 
Wo = 2 and é = 0.7; 


— the observer calculated in section 6.3.6, with ©, = 4 and & = 0.7. 


Figure 6.10 shows the responses to an initial condition x(0) = (1 D7 with state 
feedback and observer: the initial condition of the observer is x(0) = (1 D (top of 
the figure) and then x(0) = (0 oy (bottom of the figure). In the first case, the 
responses are not clear because, as the observer has been set on the system since its 
origin, only the control dynamic appeared; in the second case, the initial error is 
non-zero at { = 0, and the two dynamics of control and reconstruction intervene 
when the observer is used. 





3 The approach is extended to the case of multi-control systems at the expense of additional 
developments (see [DOR 95, FRI 86]). 
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Figure 6.10. Control by the observer, compared to control by state return 
Finally, Figure 6.11 shows Bode’s diagram of Tgo(j œ), from where we can 


obtain gain and phase margins of 12.9 dB and 50.7°. These values can be considered 
as satisfactory. 
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Figure 6.11. Determination of stability margins 


6.7. A few words on the resolution of Riccati’s equations 


In this chapter Riccati’s equations occurred several times, corresponding to 
continuous-time and discrete-time problems. These equations appear in numerous 
control engineering problems and we will finish this chapter by giving a preview on 
a few explicit methods of resolution. 


Let us consider firstly Riccati’s continuous equation: 
KALA XAOS XK +020 [6.82] 


where A, O= oF ,S= ST and X =X" are nxn real square matrices and X is 


the unknown factor. Neither the existence nor the uniqueness of the solution is 
guaranteed in the general case. In automatic control engineering, the main interest 
lies in the so-called “stabilizing” solution (we will see that it is unique), for which all 
the eigenvalues of 4 —- S X are of strictly negative real part. 
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Firstly, we will consider the 2nx2n Hamiltonian matrix, associated with 
equation [6.82]: 


A -S 
H=| osa [6.83] 
We easily verify that: 


CESR I ai 


and thus that H is similar to — HT : matrix H therefore has a symmetric spectrum 
with respect to the origin. As this is a real matrix, its spectrum is symmetric with 
respect to the real axis and hence it is also symmetric with respect to the imaginary 
axis. Furthermore, it is necessary to make the following hypothesis: matrix H does 
not have a complex eigenvalues. 


Hence, we can calculate Jordan’s form of H , which makes it possible to write: 


T, T, T, T, -A 0 
a(t m- HA 0) ane 
Ta Th Ta Taj 0 A 


where A and —A contain eigenvalues with strictly positive and strictly negative 
real parts respectively. Let us introduce the second hypothesis: matrix 7j, is 
reversible. 


These two hypotheses are especially verified in the problems discussed in 


sections 6.4.1 and 6.5.1 [ZHO 96]. Then, the stabilizing solution of equation [6.82] 
is: 


TT [6.86] 
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Demonstration. It is sufficient to formulate the terms of equation [6.85] where 7j; 
and T>; intervene: 


fie -S Tzi =-7T,, A 
T 
=O] p= A Ty = 197 A 


[6.87] 


-A 
A-SX =-T,ATy 
-Q- ATX =-Ty AT 


XA-XSX=-h AT =-0-AX 


In addition, the first equation of the second brace shows that A — S X has all its 
eigenvalues as strictly negative real parts. 


Numeric difficulties may occur when matrix H has multiple or very close 
eigenvalues, or even close to the imaginary axis. To improve this aspect, we can use 
Schur’s form [BIT 91], by replacing equation [6.85] by factoring: 


T 
U U S S U U 

a-( 11 Al 11 all 11 | [6.88] 
Un, Ur) 0 So2)\U2, Un 


where matrix U is orthogonal and matrices S;, and Sp, are in Schur’s form (i.e. 
superior quasi-triangular, the real eigenvalues of H appear on the diagonal and the 
complex eigenvalues as 2 x 2 blocks), with block S$}, corresponding to the 
eigenvalues with negative real part. Hence, we have: 


Xa UG 0A [6.89] 
In the case of Riccati’s discrete equation: 
FY F-X-F'XG(R+G'XG)'GIXYF+O0=0 [6.90] 


where all matrices are real, F, Q = o7 and X = X7 are of size nxn , R= RT is 
of size mxm, G is of size nxm and X is the unknown factor. The stabilizing 
solution, if it exists, is the one for which matrix F -G(R + GTX Gye x F has 
all its eigenvalues of the module strictly less than 1. 
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The approach is a simple transposition of the continuous-time case [BIT 91, 
VAU 70]. We must assume that matrix F is reversible, in which case equations 
[6.86] and [6.89] can be applied, by using instead of [6.83] the matrix: 


AAR eet A oo Beer aT 
a [Pro Gro -GR'G'F | eo 


of tO ET 


where F~! is the transpose of F “| This matrix has eigenvalues that are opposite 
to each other, so that Jordan’s form which makes it possible to write: 


T, T, T, T, vl 
al 11 mal 11 3 A 0 [6.92] 
Ty, Tr Ti Taj 0 A 


where A and A! contain the eigenvalues of the module more than 1 and less than 
1 respectively. Schur’s form, on the other hand, is always provided by [6.88], where 
block S}; corresponds to the eigenvalues of the module less than 1. 


6.8. Conclusion 


In this chapter, we have presented the necessary prerequisites for the creation of 
controls using the state representation. We saw that by using either an approach by 
pole placement, or an approach by quadratic optimization, it was possible to 
calculate a state feedback, then to rebuild the state and finally to obtain a control 
from the available measures. 


However, it should be clear that this chapter is left incomplete on purpose: the 
results presented are for the moment unusable for the majority of practical problems 
because they do not deal either with the pursuit of indications or with the 
disturbance rejection, and that the robustness aspects are only briefly touched upon. 
The resolution of such problems and the resulting procedures will be in fact 
developed later on. 
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Chapter 7 


Process Modeling 


7.1. Introduction 


Obtaining a model of the industrial system to automate is the first task of a control 
engineer — and not the smallest one, as the quality of his work significantly depends 
on the adequacy between the model and the procedure. 


To begin, we note that there are several points of view on a complex physical pro- 
cedure. By complex, we mean a procedure with many variables and/or a procedure in 
which the phenomena involved and the interactions between the variables, are compli- 
cated. There is no universal model; its design depends entirely on the task for which it 
will be used. 


Certain models pertain to the representation of physical components of installa- 
tion and to their connections (structural models). They can be described by diagrams 
called PI (piping-instrumentation) which define the complete diagram of installation. 
Universal graphic symbols are used in order to facilitate the interpretation of this rep- 
resentation. Nowadays, a computer representation is also adopted: these models are 
generally a database and/or an object oriented representation and they are used, for 
example, for the maintenance of the procedure, for safety analysis, for the implemen- 
tation of block diagrams, etc. 


Other models are used for the design and dimensioning of the installation and for 
managing the various operation modes; they often refer to different functions that the 
installation must fulfill (functional models). They describe the role of each subsystem 
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in performing the roles of the procedure, in connection with a structure and behavior 
of components. They are used for the design of procedure monitoring, i.e. its very 
high level of control: start-up, stop, failure management, manual reboot procedures, 
structure changes, etc. 


Let us consider the example of heating a room. The goal of the system is to heat up 
the room. To do this, we specify several functions: generation of energy, water supply, 
water circulation. These functions are based on several components. For example, 
for water circulation, we use a heater, a pump and a control valve; for the function 
of generating energy, we use a boiler, a pump and a fuel tank. To all the functions 
enumerated above, corresponding to a normal operation, we can add a “draining” 
function which would correspond to taking the installation out of service. 


The goal of other models is to describe the behavior of the installation (behav- 
ioral models); this refers to describing the evolution of physical units during all the 
operation phases, be it from a static or dynamic point of view. 


The complete description of an installation requires the representation of continu- 
ous phenomena (main process) and of discrete aspects (discontinuous actions during 
changes in the operation mode, security actions, etc.). To date these two representa- 
tion modes have been separated; for example, under a purely continuous angle, the 
synthesis of regulation loops is described by supposing that any state space is accessi- 
ble and the production planning is represented in a purely discrete manner. However, 
nowadays, there are attempts to characterize the set in a hybrid model (combination 
of two aspects, continuous and discrete), but this path still has difficulties and is still 
the subject of research. 


The behavioral model may have different objectives. The two main objectives are: 
the simulation of the installation in order to test its behavior in different situations 
offline (different control laws that the engineer seeks to compare, research into its 
limits, training of control operators, etc.) and the design of controls to implement. It 
is not necessarily the same model that is used in these two cases: the first one often 
requires more precision than the second one. In fact, for the majority of time, the con- 
trol is calculated on a linear approximation of the system around the nominal working 
point because the majority of industrial systems work (in normal operating mode) in a 
limited range, corresponding to an optimal zone for the production. We can also use, 
in order to calculate the control, a highly simplified non-linear model, the intermedi- 
ary between these two situations being the calculation of a set of linear models for 
different working points or operating modes. However, in order to design the automa- 
tion of an installation in order to optimize the working points or train the operators, 
the model must be the most robust possible. 
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The complex model is based on a precise knowledge of physical, chemical, biolog- 
ical or other laws, describing the material phenomena governing the processes imple- 
mented in the procedure. We often speak, in this case, of a knowledge model or a 
model based on the first principles. It is thus quite naturally described in the form of 
non-linear differential equations in the dynamic case and/or in the form of algebraic 
equations in the static case. These equations describe the main laws of the physical 
world, which are in general material or energy balances. When we can reduce the 
differential equations to first degree equations (by possibly introducing intermediary 
variables), we obtain an algebraic differential state model. 


The complex model can be simplified under the hypothesis of linearity, in order 
to obtain linear differential equations from which we can move either to a state repre- 
sentation or to an input-output representation by transfer function. Then, if we want, 
we can also use the traditional methods in order to discretize these linear models, in 
order to directly calculate a discrete control. The first section provides a few exam- 
ples, which are trivial in comparison with the exhaustive task of the engineer for an 
industrial procedure, but which illustrate the methodology. 


When the objective is the development of a control on a linear model, it may be 
simpler to directly research this model. This research can be done from specific exper- 
imentations. In that case we speak of identification, rather than modeling. We obtain 
a model of representation. We know well the link between the transfer function and 
the frequency response and it is thus easy to translate the latter into a mathemati- 
cal model. However, it is basically impossible to perform a harmonic analysis on an 
industrial procedure — because it is incompatible with the production constraints — or 
with the response time of the procedure. Hence, faster means have been investigated in 
order to obtain these models from time characteristic responses; the most widely used 
is of course the unit-step response because it corresponds to a change of the working 
point of the installation, in other words to a current industrial practice. Therefore, a 
few fast graphic constructions make it possible to obtain, for a minimal cost, a transfer 
function close to the system. The second section deals with this aspect. 


It was soon clear that, in order to make the model robust for the entire range of 
operation where linearization is valid, we should use input signals with a much larger 
spectrum than the step function, in order to excite all the modes of the system. As 
such we use the identification on any input-output data (but that are full of information 
regarding the behavior of the system); in this case, only the strong numerical methods 
make it possible to extract the information contained in these data sets. These methods 
are explained in the third section. The method that will be the most developed can in 
fact be used on non-linear representations and that is why we also use it in order to 
parameterize the knowledge methods mentioned above. 
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7.2. Modeling 


The behavioral modeling of a continuous procedure described in what follows is 
based on a mathematical formalism: we search for a set of equations representing 
the system in the largest possible operating range. This is a task that may take several 
months and pertains to the multi-disciplinary teams. In fact, it requires a knowledge of 
physics, chemistry, biology, etc. in order to be able to understand the phenomena that 
the model will describe, and knowing numerical analysis in order to write the model 
equations in a form that is adapted to the numerical calculus. It is also necessary to 
have computer knowledge in order to be able to implement this calculations. 


A procedure is sufficiently complex in order to be able to describe straightaway 
its behavior by a system of equations. In order to realize a global model, we need 
to decompose the general system into simpler subsystems, through a descending 
approach, then recombine the various models into an ascending approach. This 
decomposition can be found in the methodology of software development: that is 
why we can use the same tools in order to manage these approaches (SADT, for 
example). At the level of a basic subsystem, there is no optimal methodology: is it 
necessary to start by writing the most complicated model possible — by calling upon 
the description of detailed mechanisms — and later simplify it, either because we 
have no knowledge regarding the coefficients present at this elementary level and no 
possibility of estimating them in practice, or because this model is too complicated to 
be used? Or is it necessary to start by writing a very rough model and not complicate 
it unless the simulation results obtained are too inaccurate? It is obvious that the 
model must be the result of a compromise between precision and simplicity. When 
it is established, we have to verify it: this means that we test it to make sure there is 
no physical inconsistency between its behavior and the behavior of the system, due, 
for example, to numerical problems or to wrong initial hypotheses. Then we have 
to validate it; this means testing its adequacy with the set of tasks for which it was 
designed. 


In order to initiate the modeling of a reasonably complex subsystem, we generally 
write material and/or energy balances. Therefore, it is convenient to locate the energy 
or material sources at the system’s input, those at the output (in general connected to 
another subsystem), the elements that can store or lose energy or matter and those that 
transport them. 


The bond-graphs are a graphic representation tool for energy transfers in a physical 
system, sometimes used as intermediaries between the physical description of a pro- 
cedure and the writing of equations. Through a formalism reuniting fields as various 
as mechanics, they describe electricity and hydraulics — simply because they are based 
on the description of power exchange between subsystems. The graph consists of arcs 
connecting the stress variables e or the stream variables f whose product represents 
the power. Forces, torques, tension and pressure are stress variables. Speed, flow and 
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current are stream variables. There are several main elements. The resistances dissi- 
pate energy (electric resistances, viscous friction). The capacities store energy (electric 
condenser, spring), as well as inertial elements (inductance, masses, moments of iner- 
tia). The transforming elements preserve power e1 fı = e2 f2 while imposing a fixed 
ratio between streams and input and output stresses (e1 = nez, fı = f2/n). Finally, 
the junctions are of two types (called 0 and 1) depending on whether they connect 
elements that preserve the stress and distribute the stream or the other way round. We 
will not go into further detail on this method, which is dealt with in specific works 
(see [DAU 00], for example). 


By admitting that we have conveniently traced the balance equations to write, they 
are general in the form of non-linear differential equations. They can be used as such 
in the simulation fine model, but they will not be generally linearized in order to obtain 
the control calculation model. The linearization is operated as follows. Let us assume 
that the differential equation is: 


y™ (t) = g(y (8), y(t), ..., y(t), elt), t) [7.1] 


We represent [7.1] by a set of first order differential equations; this is in reality a 
possible state representation of [7.1], which is obtained by noting: 


Yi =Y [7.2] 
__ dy 
Yo = Gr [7.3] 
_ dy 
Ys = ae [7.4] 
[7.5] 
N dyn—1 
Un = a [7.6] 
dyn 
dt = 9(Yn(t), tilt); nes ,yi(t), e(t), t) [7.7] 


If the model is represented by several differential equations whose variables are 
coupled, we will generally have: 


au = fi(yi,Y2,...,Ym: et) 


[7.8] 


dym = fus Yo, «2» Um Est) 
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We suppose that system [7.8] has a balance point Yo, Eo for which the derivatives 
are zero, i.e. which is defined by: 


yi(t) = Yio + ai(t) 
fi(Yio, Y20, - - - , Ymo, Eo, t) = 0 


Now, we try to represent the trajectory of small variations x(t) and u(t) by carrying 
[7.9] over [7.8] and by using a first order Taylor serial development, which leads to: 


AYio(t) | dui(t) _ du() 











7.10 
dt dt dt [7.10] 
dfi 
= io. bra »Y¥mo; Eo,t) F dy (Yio, aie aig Ymo, Eo, t)x1(t) 
1 
dfi 
++ =" (Yio, -s Ymo, Eo, t)u(t) [7.11] 
Hence, we find the following linear approximation: 
z df df 
ci) e p Df +]: | ul) [7.12] 
den dfa ... Af} |g dfn 
dt dyı dyn Le de 


where the state matrix is the Jacobian of the non-linear relation vector f(y,e,t). 
Therefore, we obtain a linear state representation of the non-linear system. 


Simple examples 


We will take the simple example of two cascade tanks supplied by a liquid volume 
flow rate. The tanks are the two storage elements of the matter; the incoming and 


aa 


Qs 


N2 


Figure 7.1. Cascade tanks 
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outgoing flows of the second tank link this subsystem to its environment. There are 
no losses or intermediary transport element, hence we will write two matter balance 
equations, one for each of the storage elements. 


Let Qe be the volume flow rate entering the first tank, Q,; the volume flow rate 
leaving the tank, A, its section, Ny; the water level in the tank and K the restriction 
coefficient of the output tank. The outgoing flow is proportional to the square root of 
the pressure difference Ap at the edges of the tank, which is itself linked to the level 
(law of turbulent flows). Hence, we have — if the atmospheric pressure is the reference 


pressure: 
Qu = Ki VW Ap = KVM [7.13] 


The same law describes the second tank, where, in order to simplify the notations, 
we suppose that the tanks have the same coefficient K: 


Qs = Koy Ap = Kov No [7.14] 


The mass balance of the tanks gives: 


dN; 
e — Qe = A — 7.1 
Qe — Qai 14 [7.15] 
dN2 
= = À, — 7.1 
Qu Qs2 2 dt [ 6] 


In general, we can assume that the levels are subjected to small variations with 
respect to the balance given by the working points Qeo, Nig and N20. The balance is 


defined by: 
Qeo = K1V Nio = K2V N20 [7.17] 


It should be noted, however, that this equation would be sufficient if we intended 
to size up the system, i.e. to choose the tanks (K1, K2 coefficients) according to the 
average levels and flows wanted. We write: 





NE Not 
Le [7.18] 
Nə = Noo + N2 
Qe = Qeo + Qo [7.19] 


Qs1 = Kiv Nio + ni [7.20] 


The limited development of the square root leads to: 


OUT (1 eo ) [7.21] 


2 Nio 
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and similarly: 


Qs2 Toy Nz (1 Prec ) [7.22] 


2 Noo 


Equation [7.15] thus becomes: 


ln dN: d 
Qua + Qo- KV Mo(1+ sm) =A Ae [7.23] 


dt dt 

If: 

Si = 2V Nio [7.24] 

Kı 
then: 
dn: 
— = = À 7.2 
Qo a Lae [7.25] 


Based on [7.16] and [7.22], the evolution of the level of the second tank is 
described by: 








=A 72 
a S ? dt [7:26] 
where: 
2/N. 
So = a [7.27] 
Kə 


The state representation of this system follows immediately: 








ya pu [7.28] 

o pmo o ES 

X= | AS à | X + H Qo [7.29] 
A251 A252 

y=|1 ax [7.30] 


where we will measure the two levels. The transfer function of the second level is: 


_ m(s) _ S2 
H(s) > Qo(s) 7 (1 + A1S15)(1 + A2828) [7.31] 





Let us take a second example: a direct current engine operated by an armature. Let 
R and L be the resistance and the inductance of the armature, u(t) the supply voltage, 
i(t) the armature current, e(t) the back electromotive force, y(t) the engine torque, J 
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and f the inertia and frictions of the tree rotating at a speed w(t). The electric equation 
of the armature is: 
di(t) 


u(t) = Ri(t) +L di 





+ e(t) [7.32] 
The back electromotive force is proportional to speed (linear state): 

e(t) = kyw(t) [7.33] 
The engine torque is proportional to the current: 

y(t) = kei(t) [7.34] 


Newton’s law applied to the tree engine gives us the balance of the engine and 
working torques: 
dw(t 
JEU 2 49 — folt) [7.35] 


The Laplace transform applied to this group of equations gives: 


U(s) = ky Q(s) 








J Q(s)=k 7.36 
(Js + PU) = ke 17.36] 
The transfer function of the of the engine system is: 
Q(s) ko 
H(s) = = 7.37 
ai U(s) (Js+f)(R+Ls) + kiko [7.37] 
If we choose as state vector: 
X= fel [7.38] 
dt 
we find the state representation: 
. 0 1 0 
JL JL E J LJ 
and if we measure the speed: 
y= [1 0] X [7.40] 


We know that this representation is not unique. We could have chosen as state 
vector: 


x= p [7.41] 
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which gives the state representation: 


: JT ke 0 

x=| M h| x+ file [7.42] 
m L 

y=|1 0] X [7.43] 


Obviously, these two representations have the same transfer function. 


In order to have the control the question that arises is: can the value of all these 
physical parameters intervening in these knowledge models be obtained? We can use 
the manufacturers’ documentation for small systems as the ones developed below. For 
more complex systems (like chemical or biotechnological systems) this task may be 
very complicated. That is why we determine, often directly from experimental record- 
ings, the parameters of transfer functions (the two time constants of the tanks, for 
example). The following two sections describe this approach. 


7.3. Graphic identification approached 


When the objective of modeling is the research of a (simple) linear model in view 
of the control, we can use direct methods based on the use of experimental record- 
ings. Two methods are available: the use of the harmonic response of the system or 
the analysis of time responses with specific excitations. The goal researched is, for a 
minimal cost, to obtain an input-output representation of the procedure in the form of 
a continual transfer function F(p). Let us recall that F(p) models only the dynamic 
part of the procedure. The time expressions will entail the initial conditions. 


The first approach (harmonic response of the system) is rarely conceivable because 
its implementation is often incompatible with manufacturing requirements or, more so, 
because the response time of the procedure makes recording it particularly long and 
tedious. 


The second approach is based on the recording of the system’s response to the 
given excitations. In particular, we use the recording of the unit-step response, which 
corresponds, from a practical point of view, to a change in the operating point. Hence, 
from unique data, we identify the system by determining the coefficients of a standard- 
ized transfer function with a predefined structure. It is important to point out that these 
graphic methods do not make it possible to estimate the precision of the parameters 
obtained. In addition, the quality of the model depends on the operating mode (noise 
level, instrumentation, etc.) and on the operator (in particular during the use of graphs). 
It is understood that the data must be collected in the absence of saturation and that it 
is essential to verify the non-saturation at the level of internal regulation loops (when 
they exist). Finally, the graphic techniques based on the use of the unit-step response 
and presented below suppose that the system to identify is asymptotically stable. 
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The first criterion to consider for the choice of the method is whether or not the 
final value, noted by s(oo), which corresponds to a pseudo-periodical or a period- 
ical response is exceeded. In Table 7.1, there is a classification of various methods 
presented in the remaining part of this section, as well as the standardized transfer 
function and the parameters to identify. 


7.3.1. Pseudo-periodic unit-step response 


From a balance position e(t ), s(tg ), we apply at instant to a step function AE. 
Then we obtain a unit-step response in the form of the one in Figure 7.2. This response 
presents a first exceedance Aj, a final value s(co). Therefore, we use as a model the 
transfer function: 


S(p) _ K 
E(p) 1422443 


2 
wn 


F(p) = 





which has three parameters to identify K, C, wn (see Table 7.1). The unit-step response 
of the model selected has the form: 

1 
Sm(t) = s(to) + KAE I — ne sin( 1 = Cu (t — to) + s)| 
with tan 0 = 4/1 — 62/6, for t > to. The (relative) amplitude of the first exceedance, 
noted by A; %, depends only on damping Ç. By using the graph given in Figure 7.3, 
we obtain the numeric value of ¢. The angular frequency wp is linked to the period of 
oscillations. Its numeric value is obtained by using Figure 7.4. 








s(to) 


Unit-step input e(t)/ 
response of procedure s(t) 








e(to 





Figure 7.2. Pseudo-periodic unit-step response 


Graphs to use 


The two graphs to use are parameterized by damping ¢. Their use is described 
below, based on the unit-step response in Figure 7.2. 
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Figure 7.4. Response time at 5% 
according to damping Ç 


Determining the transfer function F(p): method 


The method is divided up as follows: 

1) determining the static gain of procedure K = aloen, 

2) determining the first relative exceedance (percentage) A; % = 100 x =; 
3) with the help of Figure 7.3, determining the numeric value of ¢; 


4) obtaining the response time at 5%, noted by t,5%; 


5) by using Figure 7.4, determining the numeric value of PRE, then obtaining 
the numeric value of the angular frequency wp. 
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Table 7.1. Graphic methods 


7.3.2. Aperiodic unit-step response 


The method used for the identification of transfer function depends on the general 
features of the obtained step response. If this response does not have a horizontal 
tangent at instant t = fo, the tester will obviously choose a 15° order model. Otherwise, 
the choice will have to be between three models: a delayed 1%* order model (Broïda 
method), a 2"¢ order model (Cadwell’s method), and a model (strictly) superior to 1 
delayed or not (Strejc method). Here again, the know-how and a priori knowledge are 
very important with respect to the “quality” of the model selected. In order to simplify 
the expressions, we will choose from now on to = 0. If this is not the case, a simple 
variable change makes it possible to have this situation. 


7.3.2.1. First order model 
When the unit-step response appears similar to that in Figure 7.5, the model used 
is: 
S(p) K 


P= o TFF 





The unit-step response of the model selected has the form: 
sm (t) = 5(07)+KAE(1-—e"() for t>0 [7.44] 


Two parameters must be identified: the static gain of system K and the time con- 
stant T. 
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Figure 7.5. Unit-step response for a 1* order system 


Determining the transfer function F(p): method 

The method is divided up as follows: 
s(co)—s(07). 

AE” 

2) obtaining the time constant 7. Two approaches are conceivable depending on 
whether or not we know the output final value s(co). In both cases, we use the prop- 
erties which resulted from the mathematical expression [7.44]: 

a) determining 7 using s(oo) (see Figure 7.5): the tangent at the unit-step 
response t = 0 intersects the horizontal line s(oo) for t = 7, the response time at 
5% verifies ts% = 37 and the rise time! of the system verifies tm = 2.27, 

b) determining 7 without using s(oo). We choose two instants t and tg = 2t1. 
We have the numeric values of s(tp ), s(t1) and s(t2). By using [7.44], we obtain: 


1) determining the static gain of procedure K = 


s(ti) = s(t) + K AE(1 z e=) 


a 7.45 
s(t2) = s(2t1) = s(t) + K AE(1 — e=) [7.45] 





We suppose that x = e~(#). Then Ds) = —— = a which makes 


it possible to determine x. The numeric value of the time constant is given by T = 
— HOL Expression [7.45] then enables us to calculate K. 


Other graphic methods can be used, in particular that of the semi-logarithmic plane 
[LAR 77]. However, since its use is not immediate, we will not present it here. 


7.3.2.2. Second order model 


We will present a unit-step response given in Figure 7.6. 


1. Rise time is defined between instants tı and t2 such as s(ti) = s(t ) + 0.1 K AF, 
s(t2) = s(tz) + 0.9 K AE. 
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Figure 7.6. Aperiodic unit-step response: 
choice of second order model 


When the selected model is: 
S(p K 
Fo) = 2) = 
E(p) (1+7np)(1+7p) 


with 7, < T2, the unit-step response has the time expression: 





t 
T1 le + T2 
Ti — 72 = TZ 








Sm(t) = s(07) + KAE(1- e (a) ) for t > 0 


Determining the transfer function F(p): Cadwell method 


The Cadwell method is based on the fact that the unit-step response accepts a 
particular point P,(t,,5,) which does not depend on the ratio x = 72/7, and another 
point P) whose coordinates (tz, 52) strongly depend on this ratio. The approach to 
adopt is the following: 

1) determining the static gain of procedure K = sco) (0°). 

2) obtaining the instant t4 = 1.32(7, + 72) such that: 


sı = s(0-) +0.74{s(00) — s(07)} 
3) inferring the numeric value of Tsum = 71 + T2; 
4) obtaining on the curve the point (t2, s2) such that t2 = 0.5Tsum. Inferring the 


— s(t2)—s(07). 


value of n% = 5(55)=s(0=)? 


5) with the help of the graph in Figure 7.7, determining the value of Dr Inferring 
£ = 72/71. Calculating the numeric values of 7, and 72 with: 





Le = ite [7.46] 
ne 


1+x 
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Similar method 


This method lies on an approach similar to the previous one; it was developed by 
Strejc. The approach to adopt is summed up below: 


ied : : ~ _ s(oo)—s(07), 
1) determining the static gain of procedure K = = kF ~; 


2) obtaining the instant tı = 1.2564 (Tı + 72) such that: 
81 = s(07) + 0.72{s(00) — s(07)} 


3) obtaining the value of Tsum = T1 + T2; 
4) obtaining on the curve the point (t2, s2) such that t2 = 0.3574 Tsum. Inferring 
the value of ny = D 


5) with the help of the graph in Figure 7.8, determining the value of x = 12/7. 
Calculating the numeric values of 7; and 72 via the formulae of [7.46]. 
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Figure 7.7. Cadwell method, ny, = f(x) 
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Figure 7.8. Strejc method, ny, = f (x) 
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7.3.2.3. Model of an order superior to 1 


When the unit-step response has a form similar to that in Figure 7.9, we can use the 
Strejc method. We suppose there is a low dispersion of time constants of the system — 
which is consistent with the Strejc hypothesis Tu /Tn > 0.1, where these parameters 
are defined below. The transfer function sought is: 


Ke7trP 


MO Trp 


There are four parameters to determine: K, Tp, T, n. We set to = 0. 





S(c0) 








response of procedure s(t) 


Unit-step input e(t)/ 








to=0 à 





e(to 
Figure 7.9. Aperiodic unit-step response: choice of Strejc model 


Determining the transfer function F(p): Strejc method 


The difficulty of this method lies in the course of the tangent to the point of inflex- 
ion I(t;, si). The delay T, can be adjusted in order to compensate the error due to the 
imprecision on the position of J. We do not present here the use of the graph making 
it possible to determine a fraction order n [DAV 65]. 


The approach to adopt here is summed up below: 
bk i : __ s(co)—s(07), 
1) determining the static gain of the procedure K = = ~; 
si—s(0 7). 
s(co)—s(0-)? 
3) rounding s;% to one of the values given in Table 7.2. Hence, we set the order n; 


2) tracing the tangent to the point of inflexion J. Calculating s;% = 


4) on the curve, obtaining value Tn. Inferring Tu by using the column Tu /Tn; 


5) calculating T, = tı — Tu. If T, is negative (or very small in front of Tyu, Th), 
we can readjust the tangent at the inflexion point in order to obtain T, = 0; 


6) calculating the time constant 7 by using one of the last three columns in Table 
7.2. 
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NOTE 7.1. The calculation of 7 can be done by using more than the last three columns. 
If dispersion is too significant, the position of the inflexion point must be modified, as 
well as the tangent. In practice, we can see that a slight modification of the tangent 
can lead to significant variations of parameters T;.,7,. This is explained by the fact 
that different triplets (T,., T, n) can lead to similar forms of the unit-step response. 



































n Si% Tu/Tn Tn/T Tu/T t,/T 
1 0 0 1 0 0 
2 0.26 0.104 27 0.28 1 
3 0.32 0.22 ex 0.8 2 
4 0.35 0.32 4.46 1.42 3 
5 0.37 0.41 3:12 2.1 4 
6 0.38 0.49 5.7 2.81 5 
7 0.39 0.57 6.2 3.55 6 
8 0.40 0.64 6.7 4.31 7 
9 0.407 0.71 7.2 5.08 8 

10 0.413 0.77 7.7 5.87 9 


























Table 7.2. Strejc method 


7.3.2.4. Delayed 1" order model 


We suppose that the unit-step response of the system considered can be approached 
by that of the delayed 1** order system as illustrated in Figure 7.10. The transfer 
function selected is: 


K e™+r?P 
PO) = Tap 


The delay T, can be graphically adjusted and the time constant 7 can be obtained 
by one of the methods described in section 7.3.2.1. In order to determine the numeric 
values of T, and 7, the Broïda method uses two particular points s(t,) and s(t2) 
obtained on the experiment curve. 


Determining the transfer function F(p): Broïda method 
The method is divided up as follows: 


ae i ; __ s(co)—s(0 ), 
1) determining the static gain of procedure K = = ~; 


2) obtaining tı, t2 such that: 


[7.47] 





He = (07) + 0.28{s(00) — s(0-)} 
s(t2) = s(07) + 0.4{s(00) = s(07)} 
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3) calculating the time constants of the model: 


[7.48] 


T=55 (t2 = tı) 
T, = 2.8 t1 — 1.8 t2 





Unit-step input e(t)/ 
response of procedure s(t) 











Figure 7.10. Unit-step response, use of Broïda model 


NOTE 7.2. The unit-step response of the model selected is: 


t+Tr 


sm(t) = s(07)+ KAE (1 -e~ 7 )) for t>0 





By using [7.47], we have: 





ti=Tr ) 


S= . =0.28 = 1 — e- 


Si = 


or: 


Te Sini — 0.28) = In(0.72) 
L [7.49] 
2: Lis 


T 


from which we extract he = ee Expressions [7.48] previously given are then 
easily established. 





7.3.3. Partial conclusion 


A graphic approach enables us to easily obtain a continuous transfer estimation. 
However, this facility is limited to predefined structured models associated to the 
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recording of a specific response. We need to be aware of the fact that it is essential 
to have a little, even very little, noise on the responses and, irrespective of everything, 
the precision of the result can be uncertain. To go beyond the constraints, only a purely 
numeric approach is likely to bring a positive response. This topic will be now dealt 
with. 


7.4. Identification through criterion optimization 


A first revolution, at least psychological, was to admit that we can more success- 
fully apprehend the research of behavioral models of a process, on a continuous-time 
scale by definition, by dealing with z transfer functions for which the concept of time 
unity disappeared, after a carefully chosen input-output sampling. There were at least 
two reasons for this approach: on the one hand, the problem of identification (and 
also the problem of simulation and control) is made much easier by the computing 
tool and is already well known in data analysis (linear or non-linear regression) and 
on the other hand, since the object of the behavioral model was, a priori, the syn- 
thesis of a computer control, the procedure was subjected to a “per piece constants” 
type inputs, for which the discrete model was ideally adapted (no loss of information 
during sampling and thus no approximation during the passage from continuous-time 
scale to discrete-time scale). However, this does not mean that an approach aiming 
to deal directly with the identification of a continuous model must be mocked, but we 
just need to know where not to go. To return to the initial objective of modeling, in our 
opinion it is necessary to have good reasons to give up the “model method” approach 
which consists of finding a set of parameters, for example those of a transfer, so that 
the output of the model “fits best” to the output of the procedure for the same input 
excitation. This obviously means, in the majority of cases, a non-linear optimization. 
However, the majority of control engineers dealing with identification have often kept 
ignoring this approach in order to avoid having to deal with this numeric problem, 
even if it has been well controlled for more than 20 years (see the IMSL, TOMS, 
HARWELL, NAG, etc., databases). 


74.1. Algorithms 


Generally speaking, the quasi-totality of algorithms is built on basic principles of 
the linear parametric estimation whose dynamic version — that constitutes Kalman’s 
filter [RAD 70] — represents a superior contribution to this field. The fundamental 
alternative is to use the techniques of mathematical programming — in particular at the 
level of the non-linear optimization [GIL 81]. Because of the delicate status of these 
problems, their implementation required many studies and research before robust solu- 
tions were obtained. Hundreds of publications dealt with the algorithm developments, 
the majority of which focused on the conceptual aspect of the problem. This ten- 
dency has decreased significantly since the beginning of the 1980s, and gave way to 
an increase in studies on digital and computer implementation. Before approaching 


Process Modeling 215 


this question under a more pragmatic angle, we will see what we can use from the 
multitude of approaches proposed in other works [LAR 77, LJU 87]. Hence, we will 
distinguish several categories of problems and classes of methods; however, for rea- 
sons of simplicity, we will limit ourselves to the stationary context (unknown param- 
eters constant in time) which, in reality, covers many industrial applications. 


7.4.2. Models 


7.4.2.1. Single output system 


For each input, the model is a transfer H(z), or its original h;, the weighting 
sequence, i.e. the discrete impulse response. Let us recall that in this case the relation 
between an input u+ and an output y, is given by: 


Ye = —(Q1ye-1 +++ + anYt—-n) + (bour-r + +++ + bmUut-k-m) [7.50] 


for the transfer: À 
27" (bo tes: +bmz—™) = B(2) 

H(z) = = 7.51 
(2) Ttartta" A(z) [ ] 
a; and b; are the parameters of the model. The integers k, m and n represent the 
delay (expressed in sampling intervals), the number of zeros and the number of poles 
respectively. In such a context, we can also impose that all transfers related to the 
various inputs have the same denominator. With certain methods, the user does not 


have a choice (all algorithms derived from least squares!). 





7.4.2.2. Multivariable system 


Here, the system fundamentally has several outputs and one or several inputs. In 
this case, two attitudes are possible: we either call upon a series of single output sys- 
tems, and this is generally not optimal at the level of the model’s complexity (number 
of states), or we deal with the problems directly as they stand, by using only the state 
representation: 


[7.52] 


L141 = Azt + Bu 
y = Cay 


where uz is the vector input, y, the vector output and x; the state vector. The param- 
eters to estimate are matrices A, B and C and possibly the initial state x9. Here, the 
over-parameterization represented by this approach imposes the research of so-called 
specific canonical forms. 


7.4.3. Methods 


7.4.3.1. Operation in a stochastic context 


This means that we try to identify at the same time, a model of the procedure and 
a model of the ambient noises or interferences. This second model is reduced to a 
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so-called “shaper” filter whose input, imaginary signal, is labeled as an independent 
sequence. Therefore, it is traditional to use what we call an ARMAX structure which 
consists of writing an input-output relation including the imaginary signal, called here 
innovation and noted by 1: 


Yt = — (Gye He? + On Yeon) + (Bou + + bmUt—-k-m) 
+ Ve + C1 +: + Cpr [7.53] 


Parameter r is the degree of the “shaper filter”. 


This approach usually uses “maximum likelihood” type methods, corresponding 
to the hypothesis according to which 14 is an independent Gaussian sequence, and 
“extended least squares”, which represents a version which can be operated in real- 
time. The “instrumental variable” type algorithms or “generalized least squares” are 
alternatives in which the shaper filter becomes, for example, C'(z)/D(<), instead of 
C(z)/A(z) previously. We note C(z) = 1+ c127t +--+ +¢,27" and D(z) another 
polynomial not formulated here. In all cases, the zeros and the poles of the shaper 
filter must be asymptotically stable. 


7.4.3.2. Operation in a deterministic context 
This time, the only objective is determining a model of the procedure. Without 
getting into details, we can consider that there are two basic structures, depending on 
whether we deal with an output error £+ (method of the model whose output is 7): 
m = — (a1 +: + anMi-n) + (boutr +: + bmUut-k-m) [7.54a] 
Et = Tht — Yt [7.54b] 


or an equation or prediction error e+ (“least squares” method): 
Yt = — (ar Ye 1 ST Cie D AnYt-n) a (bout—k afte le bmt k-m) + et [7.55] 


We note that formalizing an output error [7.54] or a prediction error [7.55] is only 
a particular case of what we call innovation in [7.53]. In the first case, it is enough 
to assume that r = n and a; = c;, i = 1,...,n; in the other case, we simply have 
r = 0. The fact that we speak of deterministic context does not mean that we cannot 
assimilate the interferences and/or measurement noises to random variables, contrary 
to certain interpretations. 


7.4.4. Optimization criteria 


7.4.4.1. Quadratic optimization criterion 


The formulation of the identification algorithm is expressed, from the point of view 
of calculation, in terms of linear regression (or linearized), i.e. it leads to a quadratic 
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minimization problem (with respect to parameters), more often without constraints. 
In this case, we can carry out either a general processing of all data at the same time 
(offline operation), or a sequential processing (with respect to time) of the inputs- 
outputs enabling a real-time implementation (online). We often find in works on this 
subject the name “recursive” for designating this type of approach. This term seems 
totally inadequate if no recursion appears, in the mathematical sense of the word! By 
supposing that 0 = [a1,...,ah:b0,...,b|, such a criterion is, for example, that of 
simple least squares: 


J(0) = Sve [7.56] 
t 


NOTE 7.3. With this type of criterion, the estimation of parameters is often biased! 
In fact, for the estimation not to be biased, the sequence e; must be available to an 
independent random sequence, at the optimum which is rarely the case. 


7.4.4.2. Non-linear optimization criterion 


Unlike the previous case, we consider here the hypothesis where the identification 
procedure leads to a non-linear regression. This time, the numeric procedure is then 
necessarily iterative (non-linear programming (NLP) algorithm) and thus implies an 
offline processing. The model method [RIC 71] consists of minimizing the criterion: 


J(0)= Soe [7.57] 
t 
whereas the maximum likelihood method operates with: 
J= x [7.58] 
t 
where ĝ = (0,c1,..-,¢,]. Only the solutions approached for this problem can have 


a sequential aspect. This is the case of the extended least square method with the 
criterion [7.58]. Generally, these approaches are fundamentally unbiased. 


7.4.5. The problem of precision 


Let us note by Ô the set of parameters obtained by minimizing one of the previous 
criteria [7.56] to [7.58]. A basic question is to know what degree of confidence we 
can give to this result. It is necessary to make statistic hypotheses on, for example, 
the type of interferences focused by ez, € or v+ for talking of standard deviation for 
the parameters contained in 6. However, in a context of industrial application, we can 
perfectly evaluate the precision of 6 without resorting to any stochastic reasoning. The 
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following expression, obtained from Fisher’s information theory and in relation to Rao 
Cramer’s inequalities, can be used without resorting to sophisticated theories: 





is 5 1 
o7(0) = var(6) = diag 76 = | i 4 | [7.59] 
N — dim(0) L06,06; 

Here, N represents the horizon length of the summation with respect to t in the 
criteria. The uncertainty on each component 6; of 6 is then quantified by y/(var(ĝ;)). 
The interpretation of this value then depends of course on reasonable hypotheses that 
can be associated to the context. For example, if e+, €; or v; (depending on the method 
used) can be assimilated to a Gaussian independent random variable, then 6; is an 


unbiased estimator of standard deviation \/(var(@;)). If, on the other hand, no such 
hypothesis is made, \/(var(0;)) does not necessarily represent a standard deviation 
any longer, but remains an indirect indication of the uncertainty on 6. To understand 
why this argument is justified, it is sufficient to reason in a parametric space of size 
1 instead of n + m + 1(+r). In [7.59], let us ignore for a second the reverse of the 
second derivatives matrix of the criterion, which are optimally evaluated. The term 
J(8)/(N — dim(6)) represents the root mean square of the prediction error, of the 
output error or of the innovation. If we divided it by the root mean square of the mea- 
sured output y+, we would have an image of the signal-to-noise ratio. In other words, 
the “precision” of 6, expressed via [7.59], will improve proportionally to the S/N 
ratio, which is perfectly logical. The interpretation of the second term is more subtle. 
The altitude of J minimum is in relation with the S/N ratio. If the function to mini- 
mize J presents a minimum with a flat base, the exact position of this minimum could 
considerably vary, for the same level of noise or interferences, from a data recording 
to another. In other words, the value of Ê could considerably change from one attempt 
to another. Hence, for a calculation done with a given recording, the precision of 6 
will be very low. Let us go back to formula [7.59]. A flat base corresponds to a sec- 
ond small derivative (in absolute value); its reverse is thus high, and so var (ô) is high. 
Conversely, when the criterion presents a sharp base, the position of the minimum will 
vary very little from one recording to another and the calculation obtained with any 
recording will provide an Ê which we can trust. In this case, the second derivative is 
high, its reverse is small, and var (ô) is thus small, translating a low uncertainty. In a 
multidimensional context, we can add to the previous theory the impact of the Hessian 
conditioning that increases along with an over-parameterized model and/or an input 
which is dynamically too poor. This is translated into an increase of the standard of 
the reverse — from which again a big uncertainty is obtained as expected! 


7.4.6. How to optimize 


The minimization of criteria is the milestone of identification algorithms. The 
absence of good numeric methods means that there are very few chances to reach 
a result that is certain. There are two fundamentally distinct situations to consider. 
If the criterion is quadratic, we have to use orthogonal factorization methods, be it a 
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general, offline or sequential processing [LAW 77]. It is the only way to guarantee 
that the matrices that must be symmetric and positive definite in theory are so numeri- 
cally — irrespective of the precision of calculation (simple or double). For a non-linear 
criterion, only iterative methods can solve the problem. Here the situation is less clear, 
due to the difficulty of the problem and to the multitude of possible approaches. The 
criteria to minimize are characterized by a reduced number of parameters to estimate, 
from a few units to a few dozen, their “square sum” type structure and a very high 
conditioning factor which can reach 101, Geometrically, this is translated into con- 
tour lines extremely extended in certain directions and very dense in others: mathe- 
matically, the conditioning is defined by: 
laa 06; 06; ra 06; | 


These characteristics exclude any heuristic approach. For example, using Nelder- 
Mead simplex method is absurd in this situation. Likewise, it is equally inappropriate 
to use a method of conjugate gradients (reserved for big problems of modest condi- 
tioning) and even less appropriate to use, the method of the so-called “optimal” gra- 
dient. This last choice is the worst because it fails almost systematically. Algorithms 
like those of Levenberg and Marquart, usually related to “square sums” type crite- 
ria, are efficient in the 1963 initial description only if the minimum is close to zero! 
Moreover, they are not always robust in the case of high conditioning, even if they 
have been highly improved [MOR 77]. So we are left with “quasi-Newton” type algo- 
rithms. Only the factorized Hessian implementations are capable of being numerically 
stable in the case of high conditioning [POW 75]. 











7.4.7. Partial conclusion 


The transfer approach is by far the most practical. The identification of a state 
model is of a complexity order of magnitude superior to the previous approach, due 
to the a priori over-parameterization that is, triplet (A, B,C). It is the problem of 
“canonical forms” that we have here, with the immediate incidence on the numer- 
ically undecidable character of algebraic conditions (all or nothing) of matrix rank 
(transforms of approximately projective type) [BAR 77]. Hence, no totally automatic 
procedure is surely satisfactory. Algorithms must remain guided (interactively) by the 
user at the level of the canonical structure. This is one of the arguments that encour- 
ages us to orient ourselves towards other approaches involved by questions of numeric 
conditioning (stability and robustness of calculations) without any reference to any 
canonical form. Passing through an impulse response is not of any practical interest 
nowadays, if this type of model does not lead to anything (so to speak) in terms of 
control. To conclude, we need to underline that the identification of a procedure is 
related to open loop and that, fundamentally, everything that has been said above sup- 
poses that the input does not depend on the output. Similarly, the models we referred 
to express a dynamic input-output relation, in variation around zero. 
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7.4.8. Practical application 


Implementing a behavioral (identification) model requires a knowledge which 
works on three levels: 


— conducting an ad hoc test campaign; 
— determining the proper structure of the model and estimating its parameters; 


— validating this modeling in relation to the final objective of the design of such a 
model. 


7.4.8.1. Identification protocols 


When a test campaign must be defined, a first task consists of sufficiently under- 
standing the physical operating mode of the procedure to be modeled, in order to cor- 
rectly define the magnitudes that will have the role of inputs — real or non-real inputs, 
but manipulable (regulator set points), non-manipulable inputs (measurable interfer- 
ences) — and of outputs. Then, since the procedures are a priori non-linear, several test 
campaigns could be implemented around various characteristic working points. The 
actions can be of the “slot” or “symmetric step functions” type. Being able to record 
parts of unit-step responses is always a very interesting information with respect to a 
priori knowledge (sign and magnitude order of static gain, delay evolution, presence 
of a non-minimum phase, etc.). As soon as the sinusoidal input is eliminated (to be 
kept in mind in order to verify the linearity of the procedure within a given variation 
range, for example), and as soon as a certain independence is guaranteed between the 
various inputs, we underline that it is not necessary to introduce other constraints than 
that which ensures a functioning sufficiently rich in information on the dynamics of 
tested procedure. In particular, it is not necessary to impose a test campaign controlled 
by a computer — for example, in order to obtain a pseudo-random binary (or ternary) 
sequence type of input excitation. The argument that it is easy in these conditions to 
obtain by correlation an impulse response model does not have a practical value for an 
industrial procedure. It is also true that these inputs are excellent, even the best, and 
if the identification by correlation is not advisable, “carrying out” an autocorrelation 
on the inputs and inter-correlation for inputs and outputs is an efficient way to per- 
form a concentration of initial data by replacing the real input-output pair by the input 
autocorrelation and input-output inter-correlation pair. An indispensable approach is 
to operate in closed loop. In this case, we replace the real input-output pairs by the 
input-set point and output-set point inter-correlation pairs. The set point, the external 
input of the looped system, can very well be again a binary sequence for its dynamic 
qualities, but this is still not indispensable, since the particularity of its autocorrelation 
function is no longer used. 


7.4.8.2. Identification tools 


A few years back, due to the multiplicity of solutions proposed in works on the 
subject, we could think that the difficulty is to know which algorithm to choose. The 
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problem was to encode the method selected, because of the shortage of “handy” avail- 
able solutions. Today the situation is no longer the same. There are ergonomic envi- 
ronments offering a reduced but realistic choice of possible alternatives: see [MAT 
95], which is only one of the universally known environments. Except for a specialist 
in the identification field and digital techniques, we strongly advise everyone not to 
initiate the design of an identification tool whose first vocation would be the modeling 
of industrial procedures. At the level of methods, a main choice concerns the alterna- 
tive between “prediction error” type approaches and “output error” type approaches, 
the latter being preferred by us. A second criterion refers to the available support for 
determining structural parameters such as k,m and n (or their equivalent including 
multivariable systems). Finally, it is very important to have validation means — for 
example, a very important one would be the accuracy obtained for the parameters. 
Poles, zeros, typical responses and residuals are a valuable amount of information that 
can be used by an identification software. Even if identification was presented here 
based on time information generated by the procedure, analyzing the scalar properties 
of a model can be a significant piece of information as well. Finally, the validation 
of other input-output data is easily done today by comparing, for example, the new 
residuals ez, €; or v4 With respect to those obtained by an optimization with the initial 
data, both in terms of root mean square and autocorrelation. 


7.4.8.3. “Customizing” 


As for the unconditional aspects of the “home” software, below there are a few 
recommendations on what should be done and what should not be done: 


— never encode the following type of formulae: 
P, = P — Po(1+v' Pv) tut P [7.60] 


where P is assumed to be symmetric defined positive and where v represents a column 
vector. From a mathematical point of view, P, is defined as positive if P is. Numeri- 
cally, it is not necessarily the case. Thus, for example, negative variances in Kalman’s 
filter, the sequential least squares of any type, etc.; 


— for gradient calculations, never perform lateral finite differences, but central 
ones, in the absence of the basically incontrovertible analytical form. In fact, the lateral 
finite difference cannot “tend” towards zero when the current point tends towards a 
minimum; 

— never use “gradient” type non-linear optimization methods, including the con- 
jugated gradient; 

— use factorized “quasi-Newton” type methods which are incomparably more 
robust than Levenberg-Marquant’s original algorithm, even if this is the reference in 
the “identification” environment; 


— in the calculation of a criterion whose theoretical value for the current set of 
parameters diverges (for example, a transfer output error whose poles are, during an 
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iteration, unstable), interrupt the simulation and the calculation of sensitivity func- 
tions, hence of the gradient, on a preset upper bound. In the contrary case, there is a 
fatal error risk in the optimization solver (irrespective of which one it is!); 

— use the factorized forms for all the formulae for the update of symmetric posi- 
tive defined matrix. For example, the equivalent of [7.60] is: 
z g 
0 Z 


1 0-0 
Yv Y 


QM =Q [7.61] 














where Y is Cholesky’s factorization of P; hence, by construction, Z is that of P}. 
In this approach, Q is an orthogonal matrix (Householder transformations or Givens 
rotations) calculated to perform the superior triangular shaping of M. Quantity g/z 
corresponds to the gain to be applied to the prediction error in the least squares for- 
mula. 


7.5. Conclusion around an example 


To conclude this brief overview of parametric identification, here is a test prob- 
lem with which it is easy to practice while using the various methods of identification 
previously covered. The following results were obtained with the Matlab [LJU 95] 
“identification” tool box, but any other software implementing the appropriate calcu- 
lations can be used. 


7.5.1. Simulated procedure 


It is totally artificial and easily reproducible. It is characterized by a transfer in z 
presenting the following characteristics: 


— a pure delay of two sampling intervals; 


—anon-minimum phase, with a zero of value 1.1; 





— a pair of conjugated complex number poles highly oscillating (0.9 + 0.32); 
— a faster mode (0.8); 


— a Static gain set at 5. 


The working point of this procedure is, for the input, around level 50 and, for 
the output, around level 400. The test protocol is reduced to two unbalanced slots of 
different lengths — so that after their application, the procedure is around the same 
working point. A 20 standard deviation independent Gaussian noise was added on 
the calculated output. The simulation results appear in Figure 7.11 for an 81 second 
simulation period and a sampling interval te = 0.25 second, i.e. 325 pairs (u+, y+). 
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Figure 7.11. Simulated procedure 


The reasons for these choices can be used as starting points for the “manual” con- 
struction of test campaigns. The length of the first slot enables us to “see” the unit-step 
response of the system. The short period of a second is there to tell us that it is not 
necessary to see it for a numeric approach (opposed to a graphic one) in order to ben- 
efit from the information present in the input-output data. Finally, the last one is built 
in such a way that the system returns to its initial working point. This makes it pos- 
sible to visualize a drift and to evaluate it if necessary. In addition, the sequence of 
the two ascending and descending slots makes it possible to appreciate the linearity 
hypothesis implicitly made. Moreover, the presence of vertical fronts offers the pos- 
sibility of easily evaluating an initial value of delay k. For a multi-input context, the 
same type of approach can be implemented, by paying attention to make these level 
changes at different instants for each input, so that, globally, the inputs are ‘“inde- 
pendent”. Obviously nothing prevents subcontracting to a program the generation of 
statistically independent pseudo-random binary sequences. Whoever can do a lot can 
also do a little, but again, this is not indispensable. 


7.5.2. In search of a model 


Due to space constraints here, we cannot present all the characteristic results of 
tests that progressively lead to a “good model”. Briefly, upon the selection of a method, 
we can say in general that a noise analysis on the measurements enables us to choose 
between the method of the model (if the noise seems “sufficiently white”) and the 
maximum likelihood (in the contrary case). This reasoning ensures no bias for any 
coherent experimental condition. The real difficulty is to rapidly estimate the structure 
parameters k, m and n of the transfer. Generally, it is better to start with low complex- 
ity n = 1 or 2 and increase n according to the results. Similarly, for m, the values 
0 and 1 are rarely exceeded. The initial estimation of k is done visually — hence the 
interest in the inputs with at least one isolated rigid front. 
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Figure 7.12. Method of the model 
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Figure 7.13. Method of simple least squares 
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Figure 7.14. Place of poles for a too complex model 


In order to adapt parameters k, m and n, we have several indicators: 
—as long as o (an) > |a,| = decrease n with 1; 
— as long as o (bm) > |bm| =, decrease m with 1; 
— as long as o (bo) > |bo| =, increase k with 1; 
— if Vi: |pole;| < 1 and Re(pole;) < 0 decrease n with 1; 
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— if the output error reaches a level inferior to the noise level, then there is over- 
parameterization. 


The first three arguments express that maybe the parameters in question are zero. 
The following one indicates that a fast oscillating pole, which cannot be the result of 
a correctly sampled continuous pole, models the measurement noise rather than the 
procedure. In other words, we have a too complicated model which adjusts the data 
so well that the noise present in the measurements is no longer filtered. Finally, the 
fifth procedure is based on the same approach. It is equally appropriate to examine the 
autocorrelation (or the spectrum) of the minimized error which must be sufficiently 
characteristic of an independent sequence, with respect to the hypotheses justifying 
the choice of the method and model. Numerous other details coming from experience 
are to be considered and their synthesis would be difficult here. Anyone interested 
may refer to the works previously mentioned. 


To conclude, below a few characteristic results for a good structure are listed, i.e. 
k = 2,m = 1,n = 3. Through the model method (see Figure 7.12), we have obtained 
the results in Table 7.3. 
































Parameter Gain Zero Pole: 2 Poleg 
Value 4.906 1.0967 8.93e—1 + 2.96e—1 à 7.73e—1 
Standard deviation 7.3e—2 2.3e—3 1.7e—3 8.1e—3 
Parameter bo by ay az a3 

Value —1.141 1.251 —2.560 2.266 —6.843e— 1 
Standard deviation 3.6e—2 3.8e—2 9.6e—3 1.8e—2 8.9e—3 


























Table 7.3. The model method 


For comparison, the least squares method leads to Figure 7.13, which clearly 
underlines the significant bias present in the estimation of model parameters. The 
parameters are those in Table 7.4. 



































Parameter Gain Zero Pole: ,2 Poles 
Value 4.11 1.38 9.18e—1 + 2.23e—1 i 4.72e—1 
Standard deviation 5.2e—1 5.2e—1 1.19e—2 5.0e—2 
Parameter bo by ay ag a3 
Value —8.93e—1 1.23 —1.36 2.50e—2 4.22e—1 
Standard deviation 2.4e—1 2.4e—1 4.9e—2 8.9e—2 4.9e—2 


























Table 7.4. Least squares method 


Even if the standard deviations are not so good, this degradation does not make it 
possible to explain the non-adequacy of the model. Only the residual analysis makes 
it possible to explain the strongly biased character of the least squares method. 
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Finally, if we test a transfer with n = 5, two poles in the left unit semi-circle 
appear (see Figure 7.14), which are indicators of a not very complex model. These 
results were obtained with the model method. 
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Chapter 8 


Simulation and Implementation 
of Continuous Time Loops 


8.1. Introduction 


This chapter deals with ordinary differential equations, as opposed to partial deriv- 
ative equations. Among the various possible problems, we will consider exclusively 
the situations with given initial conditions. In practice, the other situations — fixed final 
and/or intermediary conditions — can always be solved by a sequence of problems 
with initial conditions that we try to, by optimization, determine so that the other 
conditions are satisfied. Similarly, we will limit ourselves to 1% order systems (using 
only first order derivatives) as long as in practice we can always obtain such a system 
by increasing the number of equations. 


We will study successively the linear and non-linear cases. Even though the lin- 
ear case has by definition explicit solutions, the passage from formal expression to a 
virtual reality, with the objective of simulating, is not so trivial. On the other hand, in 
automatic control, Lyapunov or Sylvester’s matrix equations, even if also linear, can- 
not be processed immediately, due to a prohibitive calculating time. For the non-linear 
case we will analyze the explicit approaches — which remain the most competitive for 
the systems whose dynamics remain of the same order of magnitude — and then we 
will finish by presenting a few explicit diagrams mainly addressing systems whose 
dynamics can significantly vary. 
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8.1.1. About linear equations 


The specific techniques of linear differential equations are fundamentally exact 
integration diagrams, provided that the excitation signals are constant between two 
sampling instants. The only restrictions of the integration interval thus remain exclu- 
sively related to the sensitivity of underlying numerical calculations. In fact, irrespec- 
tive of this integration interval, theoretically we have to obtain an exact value of the 
trajectory sought. In practice, this can be very different, irrespective of the precision 
of the machine, as soon as it is completed. 


8.1.2. About non-linear equations 


Inversely, in the non-linear case, the integration numerical diagrams can essentially 
generate only one approximation of the exact trajectory, as small as the integration 
interval, within the precision limits of the machine (mathematically, it cannot tend 
towards 0 here). On the other hand, we can, in theory, build integration diagrams of 
increasing precision, for a fixed integration interval, but whose sensitivity increases so 
fast that it makes their implementation almost impossible. 


It is with respect to this apparent contradiction that we will try to orient the reader 
towards algorithms likely to best respond to the requirements of speed and accuracy 
accessible in simulation. 


8.2. Standard linear equations 
8.2.1. Definition of the problem 


We will adopt the notations usually used to describe the state forms and linear 
dynamic systems. Hence, let us take the system: 


X (t) = AX(t) + BU(t) [8.1] 


Matrices À, B and C are constant and verify A € R”*”, B € R°*7. As for X 
and U, their size is given by X € R"*™ and U € R™*". To establish the solution of 
these equations, we examine the free state, and then the forced state with zero initial 
conditions. For a free state, we have: 


X(t) = eA") X (to) 


and for a forced state, with X (to) = 0: 


X(t) = i | eAG-T) BU(r) dr 


to 
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In the end we obtain: 


t 
X(t) =e) X (to) + f eAË-T) BU (r) dr [8.2] 


to 


8.2.2. Solving principle 


Based on this well known result, the question is to simulate signal X(t). This 
objective implies an a priori sampling interval, at least with respect to the storage 
of the calculation result of this signal. In the linear context, the integration will be 
done with this same sampling interval noted by h. In reference to the context of usual 
application of this type of question, it is quite natural to assume that the excitation 
signal U (t) is constant between two sampling instant. More exactly, we admit that: 


U(t) =U(kh), Vt € [kh, (k+ 1h] [8.3] 


If this hypothesis was not verified, the next results — instead of being formally 
exact — would represent an approximation dependent on h, a phenomenon that is found 
by definition in the non-linear case. Henceforth, we will have Xy = X (kh) and the 
same for U (t). From equation [8.2], by supposing that to = kh and t = (k + 1)h, we 
obtain: 


(k+1)h 
Nga = e^" Xp + | / elt D2—7] dr! BU, [8.4] 
kh 
This recurrence can be written as: 
Xk+1 = OX, + TU; [8.5] 


By doing the necessary changes of variables, the integral defining I is considerably 
simplified to give along with ® the two basic relations: 


P = e^h 
8.6 
T= fe eATB dr Be] 


8.2.3. Practical implementation 


It is fundamental not to try to develop lin any way. In particular, it is particu- 
larly inadvisable to want to formulate the integral when A is regular. In fact, in this 
particular case, it is easy to obtain T = A~![® — 7]B = [@ — I] A~'B. These for- 
mulae cannot be an initial point for an algorithm, insofar as [ could be marred by a 
calculation error, which is even more significant if matrix A is poorly conditioned. An 
elegant and robust solution consists of obtaining simultaneously ® and I through the 


relation: 
el -a(g à n) 187] 
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The sizes of blocks 0 and I are such that the partitioned matrices are of size (m+n) x 


(m + n). This result is obtained by considering the differential system w= MW, 
W (0) = J, with: 


[8.8] 


4 


0 0 


and by calculating the explicit solution W (h), via the results covered at the beginning 
of this section. 


There are two points left to be examined: determining the sampling interval h and 
the calculation of ® and I’. The calculation of the matrix exponential function remains 
an open problem in the general context. What we mean is that, irrespective of the 
algorithm — as sophisticated as it is — we can always find a matrix whose exponential 
function will be marred by a randomly big error. On the other hand, in the context 
of simulation, the presence of sampling interval represents a degree of freedom that 
makes it possible to obtain a solution, almost with the precision of the machine, reach- 
ing the choice of the proper algorithm. The best approach, and at the same time the 
fastest if it is well coded, consists of using Padé approximants. The choice of h and 
the calculation of & and T are then closely linked. The optimal interval is given by: 


i ae 
0 0 

This approach does not suppose in practice any constraint, even if another signal 
storage interval were imposed. In fact, if this storage interval is bigger, we integrate 
with the interval given by [8.9] and we sub-sample by interpolating if necessary. If, on 
the contrary, it is smaller, we decrease h to return to the storage interval. The explana- 
tion of this approach lies on the fact that formula [8.9] represents an upper bound for 
the numerical stability of the calculation of exponential function [8.7]. Since the value 
of the interval is now known, we have to determine order q of the approximant which 
will guarantee the accuracy of the machine to the result of the exponential function. 
This is obtained very easily via the condition: 


h = max 2’: 
1EZ 


< 1 [8.9] 

















= mini: |MAl te <6, es41 = “3 =2 [8.10 
q = mini: ||MhI| ei LE, ej+1 10; + DOG; +3) el = 3 [8.10] 


where M is given by [8.8] and e€ is the accuracy of the machine. 


NOTE 8.1. For a machine of IEEE standard (all PCs, for example), we have q < 8 
double precision. Similarly, if || Mh|| < À, q = 6 guarantees 16 decimals. 


Let us return to equation [8.7] and we shall write it as follows: 


N=e™h 
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Let N be the estimated value of N; then, N is obtained by solving the following 
linear system, whose conditioning is always close to 1: 


Pq(—Mh)N = pq(Mh) [8.11] 


where p,(x) is the q degree polynomial defined by: 


q | q tek 
pate) = Das, a=] = [8.12] 
i i=1 | 


In short, the integration of [8.1] is done from [8.5]. The calculation of ® and T is 
obtained via the estimation N of N. Finally, the calculation of N goes through that of 
the upper bound of sampling interval [8.9], the determination of the order of the Padé 
approximant [8.10], the evaluation of the corresponding polynomial [8.12] and finally 
the solving of the linear system [8.11]. 


NOTE 8.2. We can easily increase the value of the upper bound of sampling interval if 
BI > || Al]. It is enough to standardize controls U(t) in order to have ||B|| < ||A||. 
Once this operation is done, we can again improve the situation by changing M in 
M — pl, with u = tr(M)/(n + m). We have in fact |M — 1} < ||M]||. The initial 
exponential function is obtained via N = e@e(M@—# Dh, 


NOTE 8.3. From a practical point of view, it is not necessary to build matrix M in 
order to create the set of calculation stages. This point will be explored — in a more 
general context — a little later (see section 8.3.3). We can finally choose the matrix 
standard Lı or Loo, which is trivial to evaluate. 


8.3. Specific linear equations 
8.3.1. Definition of the problem 


We will now study Sylvester differential equations whose particular case is rep- 
resented by Lyapunov differential equations. These are again linear differential equa- 
tions, but whose structure imposes in practice a specific approach without which they 
basically remain unsolvable, except in the academic samples. These equations are 
written: 


Le} 


X (t) = AX(t)+X(t)A2+D, X(0)=C [8.13] 


The usual procedure here is to assume tọ = 0, which does not reduce in any way 
the generality of the statement. The size of matrices is specified by Ay € R™*™, 
A> € R™*"™ and X, D,C € R™”*"2. It is clear that based on [8.13], the equation 
remains linear. However, the structure of the unknown factor does not enable us to 
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directly apply the results of the previous section. From a theoretical point of view, we 
can, however, return by transforming [8.13] into a system directly similar to [8.1], via 
Kronecker’s product, but of a size which is not usable for the majority of the time 
(ning X nına). To set the orders of magnitudes, we suppose that ny = ng. The 
memory cost of such an approach is then in n* and the calculation cost in nê. It is 
clear that we must approach the solution of this problem differently. A first method 
consists of noting that: 


X(t)=eht(C-E)eAt+E [8.14] 


verifies [8.13], if E is the solution of Sylvester algebraic equation A1 E + E A3 + 
D = 0. Two comments should be noted here. The first is that we shifted the difficulty 
without actually solving it — because we must calculate Æ, which is not necessarily 
trivial. Secondly, the non-singularity of this equation imposes constraints on A, and 
A2 which are not necessary in order to be able to solve the differential equation [8.13]. 


8.3.2. Solving principle 


A second richer method consists of seeing that: 
t 
X(t) = f e“17(A\C — CAg + D)e*?7 dr +C [8.15] 
0 


is also solution of [8.13], without any restriction on the problem data. Now we will 
examine how to calculate this integral by using the techniques specified for the linear 
standard case. For this, we have: 


Carne ee sue 
Thus, we have: 
Y(t) = VHW (t) [8.17] 
with: 
a(o A ) = Ki A] ~ S(t) [8.18] 


It is clear that S(t) is the solution of the standard linear differential equation: 


ale 20) . E ! A ee Gale 5(0) =1 
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However, by formulating it, we have: 


V= AV" VOySI 
W=-AW+QZ, W(0)=0 
J=AgZ, Z(0)=I 


which thus gives: 


V(t) = et 
W(t) = fo e0- Qe427 dr [8.19] 
Z(t) = eAzt 


From [8.19], we have: 
W(t) =e"“*Y (t) = V (t)Y (t) 


which leads to the announced result [8.17]. The solution X (t), to the initial condition, 
is identified with Y (t), because we have X(t) = Y (t) + C. 


The particular case of Lyapunov equations represents a privileged situation, as long 
as the inversion of V(t) disappears. In fact, when we have: 


A2 = AT =A [8.20] 


there is: 


from where: 
Y(t) = Z7 (t)W(t) [8.21] 


8.3.3. Practical implementation 


Again, everything lies on a calculation of matrix exponential function. Let us sup- 
pose again that: 


_|-Ar Q 
m= | A [8.22] 


The argument previously developed for the choice of integration interval is applied 
without change in this new context, including the techniques mentioned in Note 8.2. 
However, we have to note that, in the case of Lyapunov equations, we necessarily have 


234 Analysis and Control of Linear Systems 


u = 0. Since the integration interval is fixed, the order of the Padé approximant is still 
given by [8.10]. In practice, it is useful to examine how we can calculate the matrix 
polynomials [8.11]. Hence: 


pa( Mh) = Be | 


No 
Dı Dy 
—Mh) = 
Pal ) | 0 ] 


We have the approximation of S(h) [8.18]: 


5 =|" ale Nae we) 


By developing: 


V(h) = e^” ~ DIN = 
W(h) ~ DI (Ni — Di2 D3 ')N2 [8.23] 
Z(h) = e42h ~ D, Ne = D 


Based on [8.17], we have: 
Y(h) ~ NI (Ni2 — DioD No = Yi [8.24] 
Considering definition Y (t), we have: 
Yeqi = 01Y, 82 [8.25] 


a recurrence relation which gives the sought trajectory by addition of initial condition 


C. 


8.4. Stability, stiffness and integration horizon 


The simulation context is by definition to simulate reality. The reality manages 
limited quantities and, consequently, the differential equations that we simulate are 
dynamically stable when they must be calculated on high time horizons. On the con- 
trary, the dynamically unstable equations can only be used on very short periods of 
time, in direct relation to the speed with which they diverge. Let us go back to the 
previous situation — by far the most frequent one. Let us exclude for the time being the 
presence of a complex integrator (zero pole) and let us deal with the asymptotically 
stable case, i.e. when all the poles are of strictly negative real part. The experimental 
duration of a simulation is naturally guided by the slowest time constant Ty of the 
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signal or its cover (if it is of the damped oscillator type). On the other hand, the con- 
straint on the integration interval [8.9] will be in direct relation with the slowest time 
constraint Tm (of the signal and its cover). Let us recall that: 


—1 
T = —, Re(A) <0 8.26 
Rao) ae [8.26] 
where À designates an eigenvalue, or pole of the system, and T the corresponding time 
constraint, and that, on the other hand, for a matrix A: 


|| Al] > max |; [8.27] 


It is clear that we are in a situation where we want to integrate in a horizon that is 
as long as Ty is high, with an integration interval that is as small as T, is low. This 
relation between the slow and fast dynamics is called stiffness. 


DEFINITION 8.1. We call stiffness of a system of asymptotically stable linear differ- 
ential equations the relation: 
Tu Re(Am) 
SS SS 8.28 
Te REO) [8:28] 
where Am and Xm, are respectively the poles with the highest and smallest negative 
real part, of absolute value. 


NOTE 8.4. For standard linear systems [8.1], the poles are directly the eigenvalues of 
A. For Sylvester equations [8.13], the poles are eigenvalues of M = In, 9 A41 +AF @ 
In,» i.e. the set of pairs À; + u; where À; and u; are the eigenvalues of A; and Ag. 


The stiff systems (p >> 100) are by nature systems which are difficult to numeri- 
cally integrate. The higher the stiffness, the more delicate the simulation becomes. In 
such a context, it is necessary to have access to dedicated methods, making it possible 
to get over the paradoxical necessity of advancing with very small integration inter- 
vals, which are imposed by the presence of very short temporal constants, even when 
these fast transients disappeared from the trajectory. 


However, these dedicated techniques, which are fundamentally designed for the 
non-linear differential systems, remain incontrovertible in the stiff linear case. In fact, 
in spite of their closely related character, they represent algorithms as highly efficient 
as the specific exact diagrams of the linear case, previously analyzed. 


8.5. Non-linear differential systems 


8.5.1. Preliminary aspects 


Before directly considering the calculation algorithms, it is useful to introduce a 
few general observations. Through an extension of the notations introduced at the 
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beginning of this chapter, we will deal with equations of the form: 


x (t) = f(x,t), (to) = zo [8.29] 


Here, we have, a priori, x, f € R”. However, in order to present the integration 
techniques, we will assume n = 1. The passage to n > 1 remains trivial and essen- 
tially pertains to programming. On the other hand, as we indicated in the introduction, 
we will continue to consider only the problems with given initial conditions. However, 
the question of uniqueness can remain valid. For example, the differential equation 
t= x /t presents a “singular” point in t = 0. In order to define a unique trajectory 
among the set of solutions x = at, it is necessary to impose a condition in to # 0. The 
statement that follows provides a sufficient condition of existence and uniqueness. 


THEOREM 8.1. If (t) = f(x, t) is a differential equation such that f (x,t) is contin- 
uous on the interval [to, ts] and if there is a constant L such that | f(x,t)— f(x*,t)| < 
Lix — x*|, Vt € [to,ts] and Yx, x*, then there is a unique function x(t) continuously 


differentiable such that x (t) = f(a,t), x(to) = Xo being fixed. 





NOTE 8.5. We note that: 
— Lis called a Lipschitz constant; 
— f(x, t) is not necessarily differentiable; 
— if Of /Ox exists, the theorem implies that |3 f /ðx| < L; 
— if Of /Ox exists and |Of /Ox| < L, then the theorem is verified; 


— written within a scalar notation (n = 1), these results are easily applicable for 
n > 1. 


We will suppose in what follows that the differential equations treated verify this 
theorem (Lipschitz condition). 


8.5.2. Characterization of an algorithm 


From the instant when trajectory x(t) remains formally unknown, only the approx- 
imants of this trajectory can be rebuilt from the differential equation. On the other 
hand, the calculations being done with a finite precision, we will interpret the result of 
each calculation interval as an error-free result of a slightly different (disturbed) prob- 
lem. The question is to know whether these inevitable errors will or will not mount up 
in time to completely degenerate the approached trajectory. A first response is given 
by the following definition. 


DEFINITION 8.2. An algorithm is entirely stable for an integration interval h and for 
a given differential equation if an interference ô applied to estimation x, of «(tn) 
generates at future instants an interference increased by ô. 
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An entirely stable algorithm will not suffer interferences induced by the finite pre- 
cision of calculations. On the other hand, this property is acquired only for a given 
problem. In other terms, such a solver will perfectly operate with the problem for 
which it was designed and may not operate at all for any other problem. It is clear that 
this property is not constructive. Here is a second one that will be the basis for the 
design of all “explicit” solvers, to which the so-called Runge-Kutta incontrovertible 
family of diagrams belong. 


Initially, we introduce the reference linear problem: 
z= àz, AEC [8.30] 


DEFINITION 8.3. We call a region of absolute stability the set of values h > 0 and 
à € C for which an interference 6 applied to the estimate x, of x(t,) generates at 
future instants an interference increased by ô. 


We substituted a predefined non-linear system for an imposed linear system. The 
key of the problem lies in the fact that any unknown trajectory x(t) can be locally 
estimated by the solution of [8.30], x(t) = ae”t, on a time interval depending on 
the precision required and on the non-linearity of the problem to solve. This induces 
calculation intervals À and the faster the trajectory varies locally, the lower these cal- 
culation intervals are, and vice versa. 


We will continue to characterize an integration algorithm by now specifying the 
type of approximation errors and their magnitude order according to the calculation 
interval. To do this, we will use the following notations, with an integration interval h, 
supposed constant for the time being: 


[8.31] 


tn = nh, to = 0 
£n approximation of x(t») 


DEFINITION 8.4. We call a local error the error made during an integration interval. 


DEFINITION 8.5. We call a global error the error detected at instant t, between the 
trajectory approached £n and the exact trajectory x(t»). 


Let us formalize these errors, whose role is fundamental. Let t,, be the current 
instant. At this instant, the theoretical solution is x(t,,) and we have an approached 
solution £n. It is clear that the global error en can be evaluated by: 


En = Ln — Ltn) [8.32] 
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Now, let us continue with an interval h in order to reach instant {,,1. At instant 
tn, Tn can be considered as the exact solution of the differential equation that we 


solve, but with another initial condition. Let un (t) be this trajectory, solution of Un= 
f (un, t), with by definition un (tn) = £n. If the integration algorithm made it possible 
to solve the differential equation exactly, we would have, at instant tn+1, Un(tn41)- 
In reality, we obtain x, +1. The difference between these two values is the error made 
during a calculation interval; it is the local error: 


dn = In4+1 — Un(tn41) [8.33] 


There is no explicit relation between these two types of error. Even if we imagine 
that the global error is higher than the local error, the global error is not the accumula- 
tion of local error. The mechanism connecting these errors is complex and its analysis 
goes beyond the scope of this chapter. On the other hand, it is important to remem- 
ber the next result, where expression O(h) must be interpreted as a function of h for 
which there are two positive constants k and ho, independent from h, such that: 


|O(A)| < kh, VA] < ho [8.34] 


THEOREM 8.2. For a given integration algorithm, if the local error verifies dn = 
O(h?*1), then the global error has a magnitude order given by en = O(h?), p € N. 


NOTE 8.6. The operational algorithms have variable intervals; in this case, the magni- 
tude order of the global error must be taken with an average interval on the horizon of 
calculation considered. In practice, the conclusion remains the same. The global error 
is of a higher magnitude order than the local error. 


Since the integration interval is intuitively small (more exactly, the product hA) to 
obtain high precision, it is legitimate to think that the higher p is [8.32], the better the 
approximant built by the solver will be. This reasoning leads to the following defini- 
tion. 


DEFINITION 8.6. We call an order of an integration algorithm the integer p appearing 
in the global error. 


Therefore, we tried building the highest order algorithms, in order to obtain by def- 
inition increasing quality precisions, for a given interval. Reality is much less simple 
because, unfortunately, the higher the order is, the less the algorithms are numerically 
stable. Hence, there is a threshold beyond which we lose more — due to the finite 
precision of calculation — than what the theory expects to gain. It is easy to realize 
that the order of solvers rarely exceeds p = 6. There are two key words to classify 
the integration algorithms into four categories: the algorithms are “single-interval” or 
“multi-interval” on the one hand and on the other hand “implicit” or “explicit”. We 
will limit ourselves here to “single-interval” explicit algorithms and we will finish 
with the implicit techniques in general. 
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8.5.3. Explicit algorithms 


Explicit algorithms are the family of methods that are expressed as follows: 


t = hf (en + Ziz bijUj tn + aih), à = i ee eee [8.35] 


r 
Tn+1 — Tn + DA CiVi 


The algorithm is explicit as long as a v; depends only on with 7 < i. Finally, it is 
at one step because x,,,1 only depends on zn. Parameter r represents the cost of algo- 
rithm calculation, measured in the number of times where we evaluate the differential 
equation during an integration interval. The order of the method is a non-decreasing 
function of r. The triangular matrix B = [b;;] and vectors a = [a;] and c = [c;] are 
the algorithm parameters, chosen with the aim of creating the highest possible order 
diagram that is also the most numerically stable. Euler’s 1% order method, and more 
generally all Runge-Kutta type algorithms, meet the formulation [8.35]. We still have 
to determine the field of absolute stability of a p order algorithm. This stability is 
judged by definition on the reference equation, which is parameterized by dynamics 
À. From the current point (tn, £n), the exact value at the next instant t,+1 will be 
e*",,. For a disturbance applied to instant t, to be non-increasing in the future, the 
condition is simply |e*”| < 1. If the algorithm is of p order, this means that eò” is 
approached by its serial development in Taylor series at p order. The stability domain 
described in the complex plane u = Ah is then defined by: 


Zz 


For Euler’s method, p = 1, we find the unit circle centered in p = —1. What is 
remarkable is that the stability field of an explicit algorithm does not depend on the 
formulae used to implement it (here B, a, c), but directly on the order that character- 
izes it! In a system of non-linear equations, the role of À is kept by the eigenvalues 
of Jacobian Of /Ox; on the other hand, the magnitude order for the local and global 
error is, by definition, guaranteed only for a value of u belonging to the stability field 
of the method [8.36]. The constraint on the integration interval is thus operated by the 
“high” (absolute value of negative real value), i.e. by fast transients, even if they 
became negligible in the trajectory x(t)! This is the fundamental reason as to why an 
explicit method is not applied in order to integrate a stiff system. 


< 1 [8.36] 





Among all possible diagrams, we have chosen one corresponding to the best com- 
promise between performance and complexity, and present in all the best libraries. It 
is the Runge-Kutta-Fehlberg solver, whose particularity is to simultaneously offer a 5 
and 4 order estimation, for the same cost (r = 6) like the more traditional diagram 
of 5 order only. From this situation we will have a particularly competitive automatic 
management of the integration interval, based on the idea that when we are in the 
domain of absolute stability, the 5 order estimate can be, with respect to 4 order, the 
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exact trajectory for estimating the local error. In addition, it is thus possible to verify 
the compatibility of its magnitude order with what the theory expects. The parameters 
of this solver are: 


0 16/135 25/216 
1/4 0 0 
3/8 6,656/12,825| ,  |1,408/2,565 
a= |12/13]? ©~ |28,561/56,430|’ © ~ |2,197/4,104 
1 —9/50 -1/5 
1/2 2/55 0 
0 0 0 0 0 0 
1/4 0 0 0 0 0 
po 3/32 9/32 0 0 0 0 
= 11,932/2,197 —7,200/2,197  7,296/2,197 0 0 0 
439/216 =: 3,680/513 —845/4,104 0 0 
—8/27 2 —3,544/2,565 1,859/4,104 —11/40 0 


Here, parameter c* is the second value of c, leading to a 4 order estimation, the 5 
order estimation being provided by c. 


8.5.4. Multi-interval implicit algorithms 


The complexity of these techniques is another matter [LON 95]. Firstly, let us 
consider the implicit version of single-interval methods, which are directly obtained 
from the explicit case [8.35]: 


[e = hf (za + Eja bytta tih), i= 1,2 [8.37] 


7 
En = En + Vijay Gii 


As long as v; depends now on itself, its calculation implies the solving of a non- 
linear (static) system, precisely the one defining the differential equation to be solved. 
To simplify the future notations, we say: 


fai = F Enta) [8.38] 


A multi-interval method will be then written: 
nti = X Ainii th >. Bifnss—i [8.39] 
i=1 i=0 


The method is implicit for 69 Æ 0 and explicit for So = 0. Apart from the difficulty 
related to the implicit case already mentioned, a multi-interval algorithm cannot start, 
due to its reference to a past that does not exist at the beginning of the simulation. The 
first points are thus always calculated by a single-interval method. 
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Due toits particular context, solving non-linear systems intervening in the implicit 
structures is not done as a priority by a standard solver but rather by a specific approach 
consisting of using in parallel an explicit diagram, with the role of predictor, and an 
implicit diagram, called a corrector. In these two phases, we usually add a third one, 
called an estimator. By noting these stages P, C and E, each calculation interval is 
built on a P(EC)™ E type structure that we interpret as being constituted of a predic- 
tion initial phase followed by m estimation-correction iterations and by an estimation 
final phase. This leads to the general diagram: 


Peels = hy Gite i th Diy Bifnsi— k= 0 
as long as k < m 
k k 
E: A = E 8.40 
AH) _ yor (k) r (k) B [8.40] 
C: tapi = Dis ifp pii th ino Bifnta ns R=R+1 
as long as in the end 


E : Zn+1 = a fn+1 = FL tes) 





The number of m iterations is often imposed a priori or obtained from a conver- 
gence criterion on eP = a), If we consider formula [8.39], we have 2r + 1 
degrees of freedom. Hence, we are capable, by choosing correctly the parameters of 
the method (the à; and (;), of building an exact solver for the polynomial trajecto- 
ries x(t) of a degree inferior than or equal to 2r. From this we obtain a local error 
in O(h?"*1), i.e. a method of order 2r — therefore much more than what a single- 
interval explicit method could have expected. Unfortunately, the diagrams of 2r order 
are numerically unstable — there is phase difference (EC). We prove that it is impos- 
sible to build numerically stable algorithms of an order greater than r + 1, for r odd, 
and of order greater than r + 2, for r even. 


The field of absolute stability is always characterized from the reference equation 


t= Xx. Based on [8.39], by introducing an ag = —1, we can rewrite this relation as: 
5 Qiln+1—i + hX Bifn+1-i = 0 [8.41] 
i=0 i=0 


Applied to our reference linear equation, always with u = hA, we get: 


T 


X (ai + Bib) en41-i 


i=0 


If we apply interference 6, to £n, the ôn are governed by the same recurrence and 
thus evolve as zj’, where z; is a root of the polynomial: 


p(z) =) az”? +p piz [8.42] 
i=0 i=0 
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Consequently, the field of absolute stability of multi-interval methods (implicit or 
not depending on the value of 6o) is the set of u € C so that the roots of the polynomial 
[8.42] verify |z;| < 1. It is important to know that the field of absolute stability of 
implicit methods is always larger (often 10 times more) than that of explicit methods 
of the same order — hence their interest in spite of their high complexity. On the other 
hand, the more the order increases, the more the field is reduced. Hence, the designer 
will again have compromises to make. Many diagrams were suggested in books and it 
is obviously impossible to try to make a synthesis. For example, we chose the implicit 
Adams-Moulton method. This strategy corresponds to the following parameters, for 
r=1,...,6 with a; = 0 (i = 2,...,r), a, = 1 and ĝ;, according to the following 
table: 


FS) Mer nes ers 4 r=5 r=6 
1 1/2 5/12 9/24 251/720 475/1,440 
1/2 8/12 19/24 646/720 1,427/1,440 
1/12 5/24 264/720 —798/1,440 

1/24 106/720 482/1,440 
—19/720 —173/1,440 

27/1,440 











In all cases, the order of the method is r + 1. The interval management is done 
along with a selection of the order within the possible range, here r < 6. This consists 
of comparing the magnitude order of the local error theoretically stipulated and with 
that estimated with the help of devised differences and then adapting the interval. The 
order is selected in such a way as to maintain the highest interval, while remaining 
in the field of absolute stability. The professional codes are accompanied in reality 
by heuristic methods which are often very sophisticated and the result of long expe- 
rience, which give these tools the best compromise between cost, performance and 
robustness. As long as the stability field continues to intervene, there is nothing solved 
with respect to the stiff systems. Then, what do these techniques bring with respect to 
single-interval explicit methods, with a much less complicated design? They poten- 
tially offer better performances, in terms of the cost/order ratio, a better flexibility, the 
variation of the interval and order at the same time, and especially a chance to get 
away from this field of absolute stability — due to which it will finally be possible to 
integrate these stiff systems. 


8.5.5. Solver for stiff systems 


For non-linear systems, stiffness is always defined by [8.28], but this time the À 
are the eigenvalues of the Jacobian Of /Ox. This means that stiffness is a characteris- 
tic of the differential system, variable in time! A specific notion of stability had to be 
introduced in order to deal with this particular case, which is extremely frequent in the 
industrial applications of simulation. It is the S-stability (“sziff” stability). S-stability 
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is expressed in terms of absolute stability for the area of the complex plane ju defined 
by Re(u) < d < 0 and a precision constraint in the area {d < Re(u) < a,a > 0; 
m(yu)| < c}. The subtlety of the approach lies in the fact that no particular precision 
is required for the area Re(y) < d < 0, since it is de facto acquired by the choice of 
parameter d. In fact, always with respect to the local reference equation of dynamic À, 
when a fast transient would impose a very small interval, we verify that it has become 
negligible at the following instant. We have |e”| = |e*”| < ed, Conversely, a has the 
potential increase of the trajectory and c is there to express that in oscillating phase, a 
minimum of points are necessary to follow, with a given precision, a pseudo-period. 
Gear [GEA 71], who is at the origin of all the developments for the integration of 
stiff systems, proved that we could build S-stable single-interval implicit algorithms 
for 2 < r < 6. The counterpart is that solving a non-linear system by the phase 
(EC)™ [8.40] was not convergent anymore, due to the interval increase authorized by 
the S-stability and precisely forbidden by the absolute stability to which this diagram 
referred. This time, we must solve the non-linear system through a more traditional 
approach, like Newton type, i.e. by calculating the Jacobian of the system of differen- 
tial equations 0 f /Ox and then by solving the system that it induces in order to obtain 
the correction whose role was held by phase C’. Gear’s diagrams are summed up in 
the following table, which refers to the general relation [8.39], by noting that only 6o 
is non-zero: 


k 2 3 4 5 6 

Bo 2/3 6/11 12/25 69/137 60/147 
a, 4/3 18/11 48/25 300/137 360/147 
a, —1/3 —9/11 —36/25 -300/137 —450/147 





a3 2/11 16/25 200/137 400/147 Bae] 
a4 -3/25 —75/137 —225/147 

as 12/137  72/147 

a6 —10/147 


8.5.6. Partial conclusion 


The single-interval explicit methods are by far the simplest. On the other hand, the 
interval automatic management represents, in the majority of cases, their weak point. 
This difficulty is intrinsically linked to the fact that we have only a priori a given 
order diagram — without a Runge-Kutta-Fehlberg algorithm presented. On the con- 
trary, the multi-interval methods are compatible with the interval automatic manage- 
ment, because they easily offer multiple order diagrams. However, they cannot operate 
without using single-interval methods. They are equally more delicate to implement, 
due to the iterative aspect that characterizes them (implicit form). When, in the course 
of integration, stiffness increases, the solver must call on Gear’s parameters, with the 
necessity of using the Jacobian of the system in a Newton type iterative diagram. It is 
clear that, for a difficult problem, there is no viable simple solution. 


244 Analysis and Control of Linear Systems 


If one is not a specialist in the field, it is strongly advisable to use the calculation 
databases dedicated to this type of problem [SHA 97]. In any case, the objective of this 
chapter was not to make the reader become a specialist, but to make him an adequate 
and critical user of tools that he may have to use. 


8.6. Discretization of control laws 
8.6.1. Introduction 


A very particular case of numerical simulation consists of implementing the con- 
trol algorithms on the calculator. There are actually several methods of doing the syn- 
thesis of a regulator. À simple method consists of experimentally determining the 
discrete transfer function of the system, as we saw in the previous chapter. It is then 
natural to directly calculate a discrete control and to implement it as such on the cal- 
culator in real-time. The other approach consists of starting with a continuous model 
experimentally obtained, or from the knowledge model. Then we can discretize this 
model (z transform, discretization of state representation) and we find ourselves in the 
previous situation. We can also choose to delay the discretization until the last moment 
in order to benefit from all the know-how of the continuous control. Then we calcu- 
late a continuous regulator that will have to be simulated on the control calculator in 
real-time by a difference equation. In this last case, we generally choose to have a low 
sampling period (with respect to the dynamics of the procedure) and we generally use 
very simple simulation algorithms that we could even call simplistic! We will mention 
some of them in this section. 


NOTE 8.7. In order to take into account the presence of the pair zero order blocker/ 
sampler in the sampled loop, it is advisable to approach it by a pure delay of a sampling 
half-period e7 2° or by the transfer [BES 99]: 


B(s) = (1 — >) [8.44] 

It is clear that this transfer is negligible as soon as the frequency corresponding to 
the sampling half-period is placed in a sufficiently high frequency band with respect 
to the transfer cross-over frequencies of the system. However, it makes it possible 
to consider the phase difference brought about by the presence of the blocker and 
explains that the results obtained with the numerical regulator are sometimes different 
from those obtained with the continuous regulator. 


8.6.2. Discretization 


The continuous regulator is often obtained in the form of a transfer function, 
describing a differential equation. We seek to replace, in this equation, the differ- 
entiation operator by a numerical approximation. Depending on the approximations 
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chosen for representing the differentiation (and hence the integration), we find various 
difference equations to be programmed in the calculator. 


V 


oo 











Figure 8.2. Inferior rectangle method 
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Figure 8.3. Trapezoid method 


Let us consider first the approximation of a signal derivative y(t) by a rear differ- 
ence calculation: 


dy(t cT — Y(k- 
y(t) x YkT — Y(k-1)T [8.45] 
dt [ikr T 
This derivation method corresponds to the approximation of an integral calculation 


by the technique known as the superior rectangle method or Euler’s first method [BES 
99], which is illustrated in Figure 8.1: 


Ikr = Ln-1yr + TYkT [8.46] 
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We find a second Euler’s method, which is called an inferior rectangle method 
(Figure 8.2). It is based on the rear difference calculation for the derivative: 


dy(t) me UT — YT [8.47] 
dt Ver T | 
Ter = LT + Ty(-1)T [8.48] 


We finally find an approximation known under various names: Tustin’s approxima- 
tion, trapezoid approximation or Padé approximation. This last calculation is equiva- 
lent to the integration by the trapezoid method (Figure 8.3): 


Ter = Ir + Y-T + EYT [8.49] 


We will now deal with the relations that these approximations impose between 
the continuous and numerical transfer functions of the regulator. We know that the 
continuous derivation corresponds to the Laplace operator s. As for the delay of a 


sampling period within a difference equation, it is represented by the operator z271. If 
123 
T 





we seek the z transform of equation [8.45], we find Y (z). Therefore, we can 
conclude that each time we find a derivation in the time equation of the regulator, i.e. 
the operator s in its transfer function, we will have, in the sampled transfer function, 
the operator = 





= 8.50 

s= Ts [8.50] 

Under these conditions, it is easy to deduce, from the continuous transfer function 

of the regulator, the discrete transfer function, which can be then used in order to 

find the difference equation simulating the regulator numerically. Therefore, we will 

easily verify that the approximation by front difference (equation [8.47]) returns to the 
substitution: 





z—1 
= Ol 
s T [8.51] 


and the trapezoid method [8.49] to the substitution: 





2z—1 
3S 
Tz+1 





[8.52] 


The approximation of the inferior rectangle does not maintain the stability of the 
continuous transfer function that is digitized. In fact, the transformation (equation 
[8.51]) transposes the left half-plane of plane s into an area in the poles plane in z 
which goes beyond the unit circle (see Figure 8.4). For this reason, this is a little used 
method. 


With the transformation of the superior rectangle [8.50], the left half-plane in s is 
transposed into plane z into an area situated within the unit circle (Figure 8.5). 
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À Im(s) À Im(z) 
Re(s) Re(z) 
p= i > 
z=1+sT 








Figure 8.4. Transformation of the inferior rectangle 


For this reason, this method is preferred to the approximation of the inferior rect- 
angle. We can also note its advantage with respect to the latter in the calculation of the 
integral [8.46], which makes the value of the magnitude integrate at instant k and not 
at instant (k — 1) as in the equation [8.48]. 











A Im(s) À Im(z) 
Re(s) Re(z) 
po p 
Z= 1 
1-sT 








Figure 8.5. Transformation of the superior rectangle 


Finally, we note that Tustin transformation transposes the left half-plane s within 
the unit circle in plane z, which guarantees the same stability properties before and 
after the transformation (Figure 8.6). We have the same transformation of the complex 
plane as with the theoretical value z = e7*, whose [8.52] is precisely a 1% order Padé 
approximant. 


8.6.3. Application to PID regulators 


With these transposition tools, if the continuous regulator is given as a transfer 
function, it is enough to choose the approximation desired and to perform the corre- 
sponding substitutions. Let us take the example of the classic PID regulator. Let e(t) 
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Figure 8.6. Trapezoid transformation 


be the displacement signal at its input and u(t) the forward signal at its output. Its 
transfer function is: 


1 + sTa 


U(s) = K 
(s) = sT; 1l+s 


1+ 








E(s) [8.53] 








where Kp, Ti, Ta and N represent the setting parameters of the regulator. By using 
the approximation by the superior rectangle method, the integral term becomes: 























1 T z 
= ——_ 8.54 
sT; T; z—1 l l 
The calculation of the filtered derivative gives: 
Ta z—1 
T= — 8.55 
sla = Tr. [8.55] 
Ta Td z—1 
1 — = 1 + — | 
+s N + NT > [8.56] 
whose ratio is: a i 
7 [8.57] 
and equation [8.53] becomes: 
U T T, — 1 
UNG) | te % . [8.58] 
E(z) Tiz-1 T 24+ y4(z-1) 
that we can write in the standard form: 
T zZ Tad z—1 
Kp} . : 
>| ae T = [8.59] 
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with: 
Ta 
y= rae +, [8.60] 
and: 
Tad NTa 
— = ——_ 8.61 
T NT + Ta [ l 


We still need to write the difference equation corresponding to this transfer in order 
to have the programming algorithm of the numerical PID. We can, for example, set 
equation [8.59] at the common denominator: 

roz? +riz tre roz? +rız+r2 


FA C-D-  2+s2+9 [8.62] 








with: 
er F ý [8.63] 
n=l+y+ pr + e [8.64] 
Re a [8.65] 


and the difference equation is expressed as: 
UkT = —$1U(k-1)T — S2U(k—2)T + TOCRT + T1e(k—1)T + T2€(k-2)T [8.66] 


It is often suggested to separate the three terms — proportional, integral and deriva- 
tive — in the coding, which leads to create three intermediary actions up, u; and uq, 
that we sum up to obtain the global action: 


Up(z) = KpE(2) 
U;(z) = Kp Ele) [8.67] 


Ua(z) = K} =2E(2) 





Up kt = Kperr 
Ui kt = Ui (k-1)T + Kprerr [8.68] 
Ud kt = Yua (k-1)T + KP LE (eer — e-r) 


This encoding enables us to act separately on each action; we can, for example, 
disconnect the derived action if we prefer a PI to a PID; and we can also limit the 
integral action in order to prevent it from saturating the actuators (antireset windup). 
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In expression [8.68], we saw that the action derived will depend on the error vari- 
ation on a sampling period. If the latter is very low, it is possible that the variance 
on the error becomes of the same magnitude order as the noise, the rounding errors 
or the quantization errors. Thus, it is not reasonable to have too small a sampling 
period. Another comment should be made on the numerical realization of the integral 
term, according to the variance multiplied by the sampling period. When we are too 
close to the reference, the variance becomes low and, if the sampling period itself 
is low, the correction of the integral action may become zero if it is inferior to the 
quantization threshold. Therefore, we can notice a static error which is theoretically 
impossible when we have an integrator in the direct chain of control. One solution 
can be to increase the length of words intervening in the calculations of the integral 
action. A second solution consists of storing the part of er not taken into account 
after the product by E in order to add it to the value e(,41)7 of the next sampling 
period [LON 95]. 
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System Control 
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Chapter 9 


Analysis by Classic Scalar Approach 


9.1. Configuration of feedback loops 
9.1.1. Open loop — closed loops 

The block diagram of any closed loop control system (Figure 9.1) consists of an 
action chain and of a reaction (or feedback) chain which makes it possible to 


elaborate an error signal e(t), the difference between the input magnitude e(t) and 
the measured output magnitude r(t). The output of the system is s(t). 


Output s(t) 





Input e(t + Difference g(t 
pas © 
Feedback chain 


Figure 9.1. Block diagram of a feedback control 





When the system is subjected to interferences b(t), its general structure is 
represented by Figure 9.2 by supposing that its working point in the direct chain is 
known. We designate by E(p), R(p), &p) and S(p) the Laplace transforms of the 
input, the measurement, difference and the output respectively (see Figure 9.2). 
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E(p) + e(p) e 








R(p) 


Figure 9.2. General block diagram 





The open loop transfer function of this chain is the product of transfer functions 


of all its elements; it is the ratio: 


R(p) _ 
20) = 4 (Ply (p)B(p) 


with: 





The closed loop transfer function of this chain is the ratio S(p) 
P 


E(p)= E(p)-R(p) 
We have: 
S(p) = 4 (p)(B(p) + 4 (p) €p)) 


and: 


R(p) = B(p)S(p) 


[9.1] 


[9.2] 


[9.3] 


[9.4] 


by using equation [9.2] and by eliminating R(p) and ap) of equations [9.3] and 


[9.4], we obtain: 


4 (P) My (p) E(p)+ Lh (p) 
1+ 4 (pu (p)B(p) 1+ 4y (pu (PAP) 








S(p)= B(p) 


[9.5] 
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When the interferences are zero, B(p) =0, the transfer function of the looped 
system is then: 





Sip) (PH (P) [9.6] 

E(p) 1+4 (PPEP) | 
or simply: 

S(p)__ Mp) [9.7] 





E(p) 1+4(p)B(p) 


by supposing that u(p) = 44 (p)42 (p). 


The transfer function with respect to the interference input is obtained by having 
E(p)= 0 in equation [9.5]: 


S(p) _ l (p) 
B(p) 1+4 (p) (P)A(P) 





[9.8] 


9.1.2. Closed loop harmonic analysis 


Bandwidth 


The bandwidth of a system is the interval of angular frequencies for which the 
module of open loop harmonic gain is more than 1 in arithmetic value: 


GDL >> 1 [9.9] 


Approximate trace 


In order to simplify the determination of closed loop transfer functions, we can 
use the approximations [9.10] and [9.11]: 


So) _ 1 
Eljo) Po) 








If |u(ja) B(jo)| >>1 then [9.10] 


SUO) _ 


If [u(jæ) B(jw) <<1 then EGO 


U(jo) [9.11] 
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Figure 9.3 shows, in Bode plane, the approximate trace (in full line), of the gain 
curve of the closed loop frequency response. 


~ |HGo)| 
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Figure 9.3. Approximate trace of the closed loop harmonic response 






Point A, in particular for integrator systems, is often rejected at © =0. 


9.1.2.1. Black-Nichols diagram 


The Black-Nichols diagram makes it possible to graphically pass from the open 
loop system to the closed loop system. This diagram corresponds to a unitary 
feedback. The chart in Figure 9.4 is usable for open loop gains going from —40 dB to 
+40 dB and a phase difference between 0° and —360°. 


40 










304- ue } 





Module of open loop (dB) 





0 l 
-350 -300 -250 -200 -150 -100 -50 0 
Phase of open loop (°C) 
Figure 9.4. Black-Nichols diagram 
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When the feedback is not unitary, the transfer function can be re-written as a 
unitary feedback gain transfer function that is divided by the return gain Ø because: 


u _ u1 
OST eB B [9.12] 








9.1.2.2. Estimation of closed loop time performances from the harmonic analysis 


The closed loop (CL) frequency response is characterized by the quality 
factor Q,., also called magnification Q, i.e. the passage from the module through a 
maximum to an angular frequency æ, called resonance angular frequency. 


The time response is characterized, for a step function input, by the time of the 
first maximum ź„ and the overflow D, as indicated in Figure 9.5a, i.e. 
D = (Smax —Soo)/Sco Where Smax represents the maximum value obtained from the 
output to instant ¢,, and Sœ that obtained in permanent state. The overflow is 
expressed as a percentage. 


s(t) À 















0 dB 








Figure 9.5. (a) Time response in CL and (b) frequency response in CL 
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When the system has good damping, let é be the value of the damping 
coefficient delimited between 0.4 and 0.7, we have the relation @,t,, =3, where 
@, represents the gap angular frequency. It is the angular frequency for which the 
open loop arithmetic gain is equal to the unit. For a well damped system, the quality 
factor Q, has a value of less than 3 dB. 


9.2. Stability 


A looped system is called stable if its transfer function: 


Fip- — 4) 9.13 
WS T fa 


does not have poles of positive or zero real part. 


In other words, the necessary and sufficient condition of stability of such a 
system is that F(p) has all its poles with a negative real part. 


When the denominator of F(p) is a polynomial of order higher than 3 and does 
not reveal any obvious root, the analytical calculation of the roots may be fastidious. 
To study the stability, we then use either the geometrical criterion called Nyquist, 
where we reason only on the open loop in order to determine the stability of the 
closed loop or the so-called Routh algebraic criterion where we reason on the F(p) 
specific equation without calculating its roots. 


We can firstly show that the stability of a linear closed loop control system is 
connected to the diagrams of its open loop frequency response. 


The transfer functions #(p) and (p) are in general in the form of polynomials 
inp: 











N] (p) N3 (p) 
= d = 9.14 
uCp) A and (p) D (p) [9.14] 
then: 
N D 
FE 1) 2(P) [9.15] 


Dı (P)Do(P)+ Ny (P)N 9 (p) 
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The system’s characteristic equation is: 
Dı (P)D7(P)+ Ny (P)No (Pp) = 0 [9.16] 
or: 


1+ u(p)B(p) = 0 [9.17] 


The system is stable if equation [9.17] does not have zeros of positive or zero 
real part. 


NOTE 9.1.- the methods presented in this chapter are valid when the open loop 
transfer function “(p){(p) does not result from a set of transfer functions 
presenting simplifications of the poles-positive real part zeros type. 


9.2.1. Nyquist criterion 


This criterion is based on the traditional property of analytical functions and it 
makes it possible to predict the behavior of a looped system by only knowing the 
open loop. To do this, we use the following Cauchy’s theorem. 


When a point M of affix p describes in the complex plane a closed contour C 
(Figure 9.6a), clockwise, surrounding P poles and Z zeros of a function A(p) of the 
complex variable p, then the image of the point M through application A surrounds 
N=Z-P times the origin in the same direction. We suppose that there is no 
singularity on C. 


If we take, for example, Z =2 and P=3, then N =-1, the point M makes 1 
tour around the origin, in counterclockwise direction (Figure 9.6b). 





Figure 9.6. Plane of the complex variable p and plane of A(p) 
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The application to the Nyquist criterion leads to consider the transformation 
A(p) as being the denominator of the transfer function of the looped system. We 
want this transfer function not to have poles of positive real part and hence its 
denominator: 


A(p) =1+4u(p)#(p) [9.18] 


not to have positive real part zeros. We then choose as contour C a semicircle of 
infinite radius in the complex semi-plane on the right of the imaginary axis. C is 
called the Nyquist contour. The image of C through A(p) transformation must thus 
surround the origin, in a counterclockwise direction, as many times as the number of 
unstable poles of equation [9.18] and hence of #(p)B(p) . 


Im À Im À 
R> œ c 1 + up 


VA 











a) b) 


Figure 9.7. Contour and image of the Nyquist curve 


Contour C is chosen in such as way as to surround the poles of possible zeros of 
1+ “(p)f(p) with strictly positive real part. If C contains, for example, Z =1 zero 
and P=3 poles (Figure 9.7a), the Nyquist diagram will make N=-2 circuits 
around the origin in a clockwise direction and it will go around twice in a 
counterclockwise direction (Figure 9.7b). 


To be stable in closed loop, it is necessary that z = 0, the image of C must then 
make N = — P circuits around the origin. If the open loop transfer function u(p) (p) 
is stable, we have P =0 , the image of C through the transformation 1+ “(p){(p) 
should not surround the origin. However, the number of circuits made around the 
origin in the transformation 1+ 4(p){(p) is equal to the number of circuits made 
around the critical point —1 in the transformation “(p)f(p). 


In what follows, we will deal only with this latter transformation and we will 
study the case of open loop stable systems, that of integrator systems and finally the 
case of open loop unstable systems. 
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Case 1. Open loop stable system: Nyquist diagram 
Let us take the example: 


K 
p+a)(p+b)(p+c) 





Mp) B(p) = ( [9.19] 


where a, b and c are positive. Contour C and the Nyquist image corresponding curve 
are given by the graphs in Figure 9.8. 


In À In À 
R— œ C up 

















a) b) 


Figure 9.8. Contour and image of Nyquist curve 


The image of the semicircle of infinitely high radius, through this transformation, 
is reduced to a point, which in general is the origin (the systems that can be 
physically created always have zero gains for infinite angular frequencies). 


The image of the radius ]0*,+c[ is the curve in Figure 9.8b, corresponding to 
the trace of the open loop frequency response and described in the direction of 
increasing. The image of the ray ]—c,0 [ is the symmetric curve of Nyquist 
place, with respect to the axis of real numbers. We can easily show this symmetry 
from the expression of the open loop transfer function. In fact, based on the 
definition: 


u(p)Blp) = | hoe" FF a0 [9.20] 
0 


where A(0) represents the impulse response of the open loop system, i.e.: 


(ja) B( ja) = THO) [cos(-w0)+ j sin(-w0)|d0 [9.21] 
0 
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we have: 

u(-jo)B(-jo) = no) [cos(w6) — j sin(w0)|d0 [9.22] 
or: 

M(- jo) (-jo) = JORO) [9.23] 


Case 2. Open loop integrator system 


Let us take the example: 


u(p)B(p) = 





io [9.24] 


where a is positive. 


Contour C is chosen in such a way as to exclude the origin as indicated by Figure 
9.9a. This contour does not contain the unstable poles (P = 0) . Let C, be the circle 
ofradiuse, p =€e/" (Figure 9.9c). 


The image of the ray ]0*,+c[ is the trace of the frequency response in open 


loop covered in the direction of increasing œ. When æ—0*, the gain of 
U(p)8(p) > œ and the phase is of -7/2. When © — +, the gain — 0 and the 


phase is of —-7. The image of the ray ]-%, 07 [ is the symmetric curve, with 
respect to the axis of real numbers. The image of the semicircle of radius € is an arc 


of circle of radiusco. When p—0, u(p)B(p) > K/ p= Klee”. If œ increases 


from 07 to 0*, @ varies from -7/2 to +7/2 and the open loop gain is infinite. 
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In À In À 


RO œ 














a) 








Figure 9.9. The choice of the contour of C excludes the origin (a), the transform 
of the contour C (b), C „circle of radius £ p = ed? (c) 


The image of the semicircle of an infinitely big radius is the origin of the 
complex plane. The transform of contour C, representing the complete Nyquist 
place, is represented by Figure 9.9b. 


NOTE 9.2. — when the open loop transfer function u(p)B(p) has terms of the form 
K/ p* with Æ>1, @ varies from —kz/2 to +k7/2 and {(p)B(p) describes 
infinite k semicircles in a clockwise direction. 


Statement of Nyquist criterion 


A looped system is stable (bounded input-bounded output) if and only if the 
Nyquist place of its open loop transfer function “(p){(p), which is described in 
the direction of increasing angular frequencies, does not go through the critical point 
— 1 and makes around it a number of circuits in the counterclockwise direction equal 
to the number of unstable poles of “(p)f(p) . 


Case 3. Unstable system in open loop 
EXAMPLE 9.1.— the Nyquist trace is given in Figure 9.10. 


Mp) B(p) = [9.25] 


p(l-2p) 
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Figure 9.10. Nyquist trace, open loop unstable system 


We have P = 1, this system, which is unstable in open loop, is equally unstable in 
closed loop because the Nyquist place does not surround once the critical point —1. 


EXAMPLE 9.2. 


Mp) B(p) = Ae [9.26] 
p~ (1+0.1p) 


The Nyquist trace is given in Figure 9.1 la. 

















a) b) 


Figure 9.11. Nyquist trace, open loop stable system (a), closed loop stable system (b) 


We have P=0; this system, which is stable in open loop, is equally stable in 
closed loop because the Nyquist place does not surround the critical point — 1. 
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EXAMPLE 9.3. 


K(1+ p}X1+0.2p) 
p2(1+5p)(1+0.1p) 





Mp) B(p) = [9.27] 


The Nyquist trace is given in Figure 9.11b. We have P =0, the stability of the 
closed loop depends on the value of K. Let œ be the meeting point of the curve with 
the axis -n (as indicated in the figure). If -1<a@<0, the looped system is stable 
because Nyquist place does not surround the point —1, if œ < -—1, the looped system 
is unstable. 


Simplified Nyquist criterion: reverse criterion 


A simplified criterion can be deduced from the previous criterion. 


A system, stable in open loop, is stable in closed loop if, covering the Nyquist 
place of the open loop in the direction of the increasing @ , leaves the critical point 
on the left (Figure 9.12a). If it leaves the critical point on the right, it is unstable 
(Figure 9.12b). If the gain curve 4f(jæ) goes through the critical point, the system 


is oscillating (Figure 9.12c). 














Figure 9.12. Stable system (a), unstable system (b), oscillating system (c) 


9.2.2. Routh’s algebraic criterion 


This criterion formulates a necessary and sufficient condition so that any n 
degree polynomial has all its roots of strictly negative real part. 


We re-write the characteristic equation [9.17] in the polynomial form: 


1+ upp) = 4,p" +a, _ yp" 1 +...+a,ptag [9.28] 
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Then, we create Table 9.1 (with n +2 rows). 























1 dn An-2 Qn-4 
2 An-1 An-3 An-5 
3 bi =An-2 -An An-3 /An-1 bz=ay.4 -An An-5 /An-1 b3=an.6 -An An-7 lan 
4 C1 —Qn-3 -b3 dn-1 /b; C27đn-5 -b3 An-1 /b, C3-Ayn.7 -b4 An-1 /b, 
n+l 0 
n+2 0 0 0 0 


























Table 9.1. Routh’s table 


For a regular system, the number of non-zero terms decreases with 1 every 2 
rows and we stop as soon as we obtain a row consisting only of zeros. The first 
column of Routh’s table has n + 1 non-zero elements for a characteristic equation of 
n order. The roots of this equation are of strictly negative real part if and only if the 
terms of this first column of the table have the same sign and are not zero. 


Statement of Routh’s criterion 


A system is stable in closed loop if and only if the elements of the first column of 
Routh’s table have the same sign. 


EXAMPLE 9.4.— let us consider again example 9.1 : 


u(p)B(p) = [9.29] 


pA-2p) 


The characteristic equation is p(i-2p)+ K =0 or -2 p’ +p+K =0. The first 
two coefficients (which are the first two elements of the first column of Routh’s 
table) are of opposite signs, the looped system is unstable. 
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Let us now take example 9.2: 


nopea 
p^ (+0.1p) 


[9.30] 


The characteristic equation is p°(1+0.1p)+K(1+p)=0 or by developing 
0,1p° +p? +Kp+K=0. 


Routh’s table gives: 


0.1 K 
1 K 
K-0.1K 0 
K 0 


The system is stable in closed loop if K >0. 


NOTE 9.3.— a necessary condition to have negative real part roots is that all 
a; coefficients have the same sign. 


9.2.3. Stability margins 


The physical systems are represented by mathematical models which are 
generally not very exact. The stability of the mathematical model does not 
necessarily entail that of the physical system. Consequently, in order to take into 
account the uncertainties of the model, security margins must be defined during the 
theoretical study in order to ensure a satisfactory behavior to the looped system, 
especially when the Nyquist place of the harmonic response in open loop is near the 
critical point. 


a) Phase margin — gain margin 


The phase margin is obtained by calculating the difference between the system 
phase considered and —180° to the gap angular frequency. The gain margin is 
obtained by calculating the difference between the system gain and 0 dB to the 
angular frequency where the phase reaches —180°. These margins, noted by Ag 
and AG , are represented in Bode, Nyquist and Black-Nichols planes in Figure 9.13. 
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[HB | as À In À 
0 dB (a) 




















-180° -90° 
c) 





Figure 9.13. Bode plane (a), Nyquist plane (b), Black-Nichols plane (c) 


In Bode and Black-Nichols planes: 

A@= ġ-(-180° ) to the angular frequency w, for which up| =0dB 

AG = 0dB- WA op to the angular frequency © = ©, for which ø = -180° 
In Nyquist plane: 


A@ is represented by the angle (ON,OM) where OM = |p| =1 (0dB) 


AG =-20logx where x= | up | to the angular frequency © =, for which 
@ = -180° 


When a system is at a minimal phase difference, i.e. when all its zeros are of 
negative real part and of low-pass type, the only consideration of the phase margin is 
enough in general to ensure a convenient damping. 


b) Delay margin — module margin 


The delay margin, for a stable system with a phase margin Ag is defined as 
being the ratio: 


Te [9.31] 
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where œ, is the gap angular frequency. The delay margin represents the maximum 
allowed delay value leading exactly to the cancellation of the phase margin. 


The module margin represents the shortest geometrical distance between curve 
uf and point —1. 





Figure 9.14. Nyquist plane 


c) Degree of stability of a second order system 


Let us consider a second degree system whose closed loop transfer function, with 
a unitary feedback, has the form: 


w2 
F(p) =—— [9.32] 
p^ +26@5p +@5 


where č represents the damping coefficient and @ọ is the system’s angular 
frequency. If é is low, the unit-step response is oscillating, if € is high, the 
response is strongly damped and the transient state is long. 


The magnification Q,., i.e. the maximal gain of the closed loop module curve, 
which can be measured directly in the Black-Nichols plane (Figure 9.15), is related 
to the damping coefficient by the relation: 


1 


Q [9.33] 
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We note that when é tends towards zero, Q, tends towards infinity. The 
resonance angular frequency is given by the relation: 


©, =W)V1-€7 [9.34] 


For a conveniently damped system, Q, is less than 3 dB or a damping 
coefficient lower than 0.7. We can verify, with the help of a Black-Nichols diagram, 
that in order to have Q, <3dB, we need a phase margin Ag= 45° and a gain 
margin AG > 5dB. 


0 dB 





-180° -90° 


Figure 9.15. Black-Nichols plane 


d) Degree of stability of any order system 


When a system has any order transfer function, there will not be an explicit 
damping coefficient like in the case of second order system. However, the responses, 
with an equal resonance coefficient, have almost the same form. Hence, we can 
measure the degree of stability of any order system by its resonance factor in the 
Black-Nichols plane. 


9.3. Precision 


In section 9.1.1 we saw the general block diagram of a closed loop control 
system (Figure 9.2). The role of such a system is to follow the output s(f) with the 
input e(f). Signal e(t) injected at the input of the system is ideally the signal that we 
would want to obtain at the output. In the case of a perfect feedback control, the 
difference £(t)=e(f)-r(t) is then zero at every instant. In the case of a real 
feedback control, this difference is never zero due to the time constants of physical 
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systems, on the one hand and to the interferences acting on the system on the other 
hand. 


The quality of the feedback control is mainly translated through its stability and 
the follow-up precision of the output on the input and consequently of the dynamics 
of this difference. The precision study makes sense only if the system is stable. 


In practice, it is interesting to know the permanent error, also called static, €, 
which is the asymptotic value of the instantaneous error €, , called dynamic. 


The specifications on the error are most often formulated in one of the following 
forms: 


— bounded or zero static error in response to the inputs or canonical interferences 
(step function, ramp, harmonic signal); 


—bounded dynamic error €, <€ for the inputs having certain given 


characteristics. 


max 


In the presence of an interference, due to the overlapping principle, we have: 


E(p) = €. (p) + € (p) [9.35] 

where: 
1 
E€. (p}= ———~ E [9.36] 
2 1+ fh (2 

and: 

g, (p= a ap) [9.37] 

+442 B 


These expressions are calculated by considering firstly the zero interferences 
b(t)=0 (e(t) #0; see Figure 9.16a) and then the zero input e(t) =0 (b(t) #0, see 
Figure 9.1 6b). 
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e(t) + ED ED 








QS) 











a) b) 


Figure 9.16. Zero interferences b(t) = 0, e(t) #0 (a), zero input e(t) =0, b(t) #0 (b) 


9.3.1. Permanent error 
9.3.1.1. Step function input — zero interference 


When the input signal is a step function E(p) of amplitude Ey, E(p)=E,/p. 
Equation [9.36] then becomes: 


Eo 
€E (p) =—— [9.38] 
p(l+ ump) 
The theorem of the final value makes it possible to write: 
; | ; Eo 
lim €, (t) = lim p €, (p)= lim ——— [9.39] 
1e p—Ù p-0 (1+ 4143 p) 


This error is zero if 1+ 4442 has at least one pole at the origin, i.e. if 44 42 B 
has at least one integration. 


In general, lim ((p) is finite, hence the requirement for 4,4, to have one or 
p—0 


more integrations. 
Let us note by k the number of pure integrators; we say that the system is of class 
k. A system of class at least equal to 1 in open loop has a zero static error in 


response to an input step function. 


If the system in open loop is of class 0, the static error is equal to: 


Eo 
1+K 





Re DE [9.40] 
to 


Analysis by Classic Scalar Approach 273 


We notice that the error is inversely proportional to the static gain. 


9.3.1.2. Unit-step interference — zero input 


When the interference signal is a step function B(p) of amplitude By, 
B(p) = Bo / p . Equation [9.37] thus becomes: 


-B 
ea OE [9.41] 
P(I+ uap) 
The theorem of the final value makes it possible to write: 
LAA 
lif n Ep (t) = as p Elp) = Bo lim -—— [9.42] 
p-0 (1+ uiua 8) 
the following propositions will be verified: 
lim fy f =0 
p-0 

lim £ (t) =0 = § or [9.43] 
100 


4 (p) = * when p—0 (k21) 
pk 


In general, 4, and £ are not differentiators. The first condition is thus rarely 
satisfied. 


A system having in open loop at least one integration upstream from the 
application point of the interference (hence in u) presents a zero static error in 
response to an interference step function. However, if 4, does not have any 
integration, it is enough that 44; is an integrator system to cancel the static errors €, 
and €, at the same time. If mw, has an integration, we can introduce a 
supplementary integration in 4, in order to cancel £,(). Then we will have a 
second class system in open loop. 
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9.3.1.3. Ramp input — zero interference 


Let us calculate the final value of the error for a ramp input E(p) =Vo / p? à 


1 
Ep) =——_——_ E(p) [9.44] 
1+ up 
we have: 
1 
lim €,(t) = lim p €) =Vo lim ————— [9.45] 
too © p—0 p—0 pÜi+ ump) 
: K i 

lim Ee(t)=0 Mtb = — when p— 0 with k22 [9.46] 
t>% p 


In general, 442 must have at least two integrations in order to cancel the 
permanent error in response to an input ramp. 


If the system has only one integration in direct chain, there is a finite final error: 
K ; Vi 
Lo ~— when p>0 => lime,(t)=—2 [9.47] 
p 1 K 


Figure 9.17 sums up the permanent errors for a step function input, (Figure 
9.17a), and for a ramp input (Figure 9.17c) with a zero interference in the cases 
when “4, =K, and uw, =K, near p=0. In the cases when 4 = K, and 
My =K,/p near p=0, Figure 9.17b shows the absence of static error but the 
presence of a trail error (Figure 9.17d) again with zero interference. We notice an 
error for a interference in step function when 4, = K; and “> =K, (Figure 9.18b) 
whereas there is absence of error as soon as 4, = K; / p (Figure 9.18b). 
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Class 0 Class 1 
M = Ki; m= K2; p>0 


K2 
M1 = Ky mx PO 




















s(t) A 
Eo 
b) ne 
s(t) À s(t) À 
Ep? (ce) 
c) Ba 

















Figure 9.17. The image of responses to a step function 
input and a ramp input (zero interference) 
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Figure 9.18. The image of responses to an interference step function (zero input) 
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9.3.1.4. Sinusoidal input — zero interference 


Let us apply a sinusoidal input to the system: 
e(t) = Eg sin@ot [9.48] 


For a permanent state, €,is a harmonic signal of module le.| such that: 














1 
El = Ey ——— [9.49] 
1+ HMB], iy, 
If © is in the bandwidth of the open loop, |44 49 Pl, >> 1 we obtain the 
approximation: i 
1 
lE.| = Eo [9.50] 
MiP! p- jo, 








The error amplitude is inversely proportional to the gain of the open loop at 
angular frequency wọ. 


| uu2BGo) | 
20 log(1/a) -----> 
0 dB 








> Log © 


Figure 9.19. Gain of the open loop 


In order to obtain a given a copy precision (for example, 1% to ©, =Ird/s ), it 
is necessary that the gain of the open loop at angular frequency ©, is more than or 
equal tol/a. 
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9.3.2. Transitional error 


9.3.2.1. Case of a unitary feedback — zero interference 
e(t) 
s(t) 


Ed 
Figure 9.20. Unitary feedback 


We seek to limit €, to a value £4 max - 
For any input signal, the expression of the dynamic error is complex. However, 
if there is a limitation of the input signals at speed v and for acceleration y , it is 


possible to obtain a condition characterizing the precision required. 


The input signal is defined by the constraints: 
v<vy and Y<Yy [9.51] 


Among all the signals satisfying these conditions, the sine wave signal is by far 
the simplest. We will study this by replacing the signal by this “equivalent” sine 
wave signal. 


Let e(t)=£ 9 sin@ot be this signal. Its maximum speed and acceleration are 
given by: 


vm = Eo®o and yy = Eowë [9.52] 


These two equations define a sine wave signal which must be copied with an 
error less than £4 max - From these equations, we obtain Ey and &,: 


vi YM 
Eo =—— and wo = — [9.53] 
YM YM 
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hence the condition: 





l < fd max [9.54] 
Ju Pr ym 
Vu 
YM 


We note that this condition is necessary and, in certain cases, it may not be 
sufficient. 


9.4. Parametric sensitivity 


Multiple factors, such as ageing or change of working points may lead to 
variations on the representative model parameters of the systems. It can be 
interesting to examine the influence of the variation of these parameters on the 
overall behavior of the system (stability or precision, for example). 


A system is called sensitive to the variations of a parameter œ if its behavior is 
affected by these variations. If G(p) represents the transfer function of the system, 
the sensitivity with respect to parameter @ is defined as being the ratio: 


0G(p)/G 
= (p)/G(p) [9.55] 
dal a 
9.4.1. Open loop sensitivity 
Let the transfer function of an open loop system be: 
G(p) = Me) [9.56] 


~ D(p, a) 


Sensitivity is thus written: 


È (xo 1 eD(p) aC [9.57] 
oa N oa D 
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: K 
EXAMPLE 9.5.— let us consider a first order system: Ts where K =QK . The 





sensitivity around gain Kọ is: +4Pp 
K 
py as Laiya [9.58] 
4 Ko 1+1p 


Sensitivity is 1 and any gain variation will have consequences on the output. 


Let us suppose now that T = aT, and therefore the sensitivity around the time 
constant Tọ is: 


T, 
EE Oe [9.59] 
n |K 1+aTop 
oT, 
Sy eee [9.60] 
To 1+aTop 


We notice that in this case sensitivity is a function of frequency. 


The responses to a step function for K =5 and K =15 to the nominal response 
corresponding to Ko =10 (Figure 9.21a) show that the system is integrally affected 
during the gain variations. 





15, 

















Figure 9.21. Responses to a step function for k = 5 and k = 15 (a), for T = 0.5 and T = 2 (b) 


Similarly, we can visualize the responses to a step function for T = 0.5 and 
T =2 for comparison to the nominal response corresponding to 7, =1 (Figure 
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9.21b). The sensitivity tends towards zero when p— 0 depending on equation 
[9.60] and hence in permanent state the responses tend towards the value of the 


static gain K. 


9.4.2. Closed loop sensitivity 


Let the looped system with unitary feedback be: 


—=@—[6@) } + 








Figure 9.22. Looped system 


The closed loop transfer function is: 





_ Gp) 
BE 1+G(p) 


The sensitivity for a low variation of G(p) is obtained by: 











Sopp = OG RFLP) a 
BF 0a GBF 
or: 
COG p))- Gp) SO 
S = 1+G 
BF (sam) Gp) | +G(p)) 
SRE dG(p) a 
BE Ja G(p\l+G(p)) 
Sopr = l 


Sg 7— n 
(1+G(p)) 


[9.61] 


[9.62] 


[9.63] 


[9.64] 


[9.65] 


Analysis by Classic Scalar Approach 281 


This equality shows that the bigger 1+ G(p) is, the smaller the sensitivity of the 
looped system becomes. 


EXAMPLE 9.6.— let us return to the system G(p) = eee previously considered 


1+7p 
and relooped with a unitary gain (see Figure 9.23). 
e + K S 
aowa > 


(1 + Tp) 








Figure 9.23. Relooped system with unitary gain 


Let us take K = aK, . The closed loop transfer function is: 


KG 





G Ss se Ve 9.66 
BF (P) ak o + Ip [9.66] 
which has the form: 
K' 
Gpr(p)= [9.67] 
1+T p 








' OK ' T 
where K = 0 and T = ; 
OK 0 1+0K 0 
We imagine that the sensitivities of the closed loop diagram are less than those in 
open loop. This is verified on the behavior of the unit-step response below. In fact, 
the sensitivity around gain Kọ , with respect to the parameter « is: 


1+7 
E (1+ Tp) 


2 [9.68] 
1+&Ko +Tp 
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Let us suppose now that T = aTQ and therefore the sensitivity around the time 
constant Tọ is: 


aT, 
Cees [9.69] 
m 1+aTop+K 


Figure 9.24 shows the responses to a step function for K =5 and K =15 in 
comparison to the nominal response corresponding to Ky =10. Figure 9.24b shows 
those of the looped system for T = 0.5 and T=2 in comparison to the nominal 
response corresponding to 7, =1. 


15 


10 




















Figure 9.24. Responses to a step function 
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Chapter 10 


Synthesis of Closed Loop Control Systems 


10.1. Role of correctors: precision-stability dilemma 


The correction methods covered in this chapter refer to considerations of scalar 
behavior. It is fundamental to understand that the specifications stipulating the 
closed loop performances will be translated by the constraints on the frequency 
response of the open loop corrected system. 


The search for a compromise between stability and rapidity generally leads to 
imposing, on the closed loop, a behavior similar to that of a second order system 
having conjugated complex number poles. The choice of the damping value € is 
imposed by the required degree of stability. That is why it is indispensable to have a 
good knowledge of the relations between the parameters and the behavior of the 
second order systems in order to be able to use specifications defining the 
performances required from the closed loop final system. The general principle of a 
specification list is based on two points: 


— interpretation of specifications in order to obtain the characteristics of a second 
order model of the closed loop needed; 


— search for the constraints on the open loop introducing the behavior sought in 
closed loop. 


For reasons of clarity, let us recall the main results of the previous chapter 
concerning the analysis of the behavior of systems. 


Chapter written by Houria SIGUERDIDJANE and Martial DEMERLE. 
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10.1.1. Analysis of systems’ behavior 


10.1.1.1. Static errors 
For a zero static error: 


— with respect to a set point step function, there has to be at least one integration 
in the open loop; 


— with respect to a set point ramp, there have to be at least two integrations in the 
open loop; 


—with respect to an interference step function, there has to be one integration 
upstream from the input point of the interference in the open loop. 


10.1.1.2. Stability 


The analysis of stability of the looped system, in the current case of a stable 
system in open loop, is based on the simplified Nyquist theorem: the image of the 
contour by up must not surround the point —1. The distance with respect to point 
—1, which is expressed in terms of phase margin and gain margin provides a 
“measurement” of stability. 


We can associate a “visual” criterion of the stability measurement with the help 
of the closed loop response overflow with a set point step function. For a system 
whose closed loop transfer function is of second order, we know how to connect the 
concept of unit-step response overflow to the damping coefficient € of the poles and 
consequently to the phase margin of the open loop (OL). We note that for a second 
order system, a sufficient phase margin implies a good gain margin and the only 
concept of phase margin is thus sufficient. 


100 


D% 
un 
S 





Figure 10.1. Overflow curve based on damping 
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Another way to characterize a stability measurement consists of analyzing the 
frequency response of the closed loop. The instability is characterized by the 
presence of a resonance peak. 


For a system whose closed loop transfer function is of second order, we connect 
this concept of frequency response resonance of the closed loop to the damping 
coefficient é of the poles and hence to the phase margin of the open loop. 

















la A® in degrees Resonance in dB 
0.1 12 12 

0.2 22 8 

0.4 43 2.7 

0.6 58 0.35 

















Table 10.1. Damping, phase and resonance margin 


There is no more resonance from & = 0.7. 


10.1.1.3. Rapidity 


The character of rapidity can be perceived in two ways, either directly by 
observing a time or frequency response, or indirectly through the concept of 
dynamic error (a rapid system enables the pursuit of an input that rapidly varies and 
hence a low dynamic error; see the method of the equivalent sine curve described in 
the previous chapter). 


Closed loop frequency behavior, bandwidth 
To evaluate the rapidity of the looped system, a sinusoidal input is used whose 
frequency can be chosen. More often than not, the transfer function of feedback 8 
is a constant and we can then consider two cases for the behavior in closed loop 
u 
1+uB 





If |u| >>1, i.e. if we are in the bandwidth in open loop, we can consider the 
closed loop behavior equivalent to 1/  , presenting a constant gain irrespective of 
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the frequency. For any frequency sinusoidal input in the bandwidth, we can then 
consider that the output is a faithful image of the input. 


If |u| <<1, i.e. we are outside the bandwidth in open loop, we can consider the 
behavior in closed loop equivalent to  . In this case, the amplitude of the output is 
very low compared to the amplitude of the input. 


A simple evaluation of rapidity consists of considering the bandwidth w, in 
closed loop. With a constant feedback £ , we can then consider the closed loop gain 
as constant if up| >> 1. Hence, we can associate the bandwidth in closed loop with 
the bandwidth in open loop. The bandwidth in open loop is a qualifier of the rapidity 
in closed loop. 


Time behavior in unit-step response 


We can also qualify the rapidity of a closed loop system by observing its 
response to a set point step function. For a system without overflow, we can measure 
the establishing time, but the dilemma precision-stability often leads to a 
compromise that entails a response with overflow. For a system having such a 
response, we use as a rapidity criterion the time necessary to reach the first 
maximum noted by t, . 


For a second order system we can connect the time notion of first maximum to 


the concept of bandwidth in open loop tp =7/&,.11-€ 2 We can use in general 


the relation approached: @,t,, ~3 which is valid for any correct damping 
(0.6<£<1) 


Due to this relation, we can translate a first maximum time constraint of a unit- 
step response in closed loop into a bandwidth constraint in open loop. 


Dynamic precision 


In order to calculate an upper bound of the instantaneous error following the 
variations of any set point, we use the method of the equivalent sine curve. The 
result of this analysis leads to a specification of minimum gain in open loop for a 
certain bandwidth 0 to &,. Therefore, this constraint leads in general to imposing a 
certain bandwidth to the open loop, even if the constraint in the gain refers only to 
the bandwidth 0 to wọ. 
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In short, we will use the following rules: 
— in order to obtain a good precision, we need to: 


- in static state, have one or more integrations in direct chain in order to cancel 
the permanent state errors, 


- in dynamic state, have a high gain for a frequency band until ©, . This makes 
it possible to limit the dynamic errors; 


— for a good degree of stability, it is necessary that the phase margins and (or) the 
gain margins defined near w, are satisfactory; 


— for good rapidity, it is necessary that the bandwidth in open loop is large and 
therefore generally having a high gain. 


If we use the Bode diagram to represent the scalar characteristics of the open 
loop to be corrected, we could translate the above constraints depending on three 
frequency areas as indicated in Figure 10.2. 


The previous chart underlines the contradictions between the specifications 
concerning precision and stability. An increase in the gain favors the precision at the 
expense of stability. 


2 i Dynamic errors 


Rapidity 








> 
Wy Oe o! © 


Ÿ ay 
A6 Stability 


Figure 10.2. Bode diagram of an open loop to be corrected 


The correctors or regulators have the goal to provide a control signal u to the 
process in order to maintain the requirements of precision and stability. They are 
inserted into a looped system as represented in Figure 10.3. 
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b(t) 














Correctors 


Intermediary 
control 


[B | 


Figure 10.3. Corrector in a looped system 





Pad 








The control magnitude can be a function of €,x,b or e. Based on the signals 
considered and the type of the function created, there are several types of correctors. 


Firstly, we will review the topologies of the most widely used correctors. 


10.1.2. Serial correction 


e(t) + Q e(t) u(t) page BE s(t) 















Figure 10.4. Serial corrector 


This type of corrector is inserted in the direct chain in a serial connection with 
the process and provides a control signal: 


u(t) = f(e) 


The control takes into account only the error signal. 


Among the usable functions f, we can find: 


— the proportional action noted by P: 


u(t) = ke(t) or U(p) = ke(p) 
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— the integral action noted by /: 


t 
wie | e(O)dt or U(p) = 
10 T; 


an 


— the derived action noted by D: 


de(t) 


u(t) = Ty —— di 


or U(p)=Ty pE(p) 


— the phase lead action: 


1+7p 


HUE Far 


e(p) with a<1 


— the phase delay action: 


1+7p 


CE Fra 


e(p) with a>1 


A serial corrector creates the combinations of these actions more or less 
perfectly. 


It should be noted that this corrector acts on the static precision, dynamic 
precision and stability. 

For a first approximation: 

— action 7 cancels the static error; 

— action P increases the dynamic precision; 


— action D or the phase lead tend to stabilize the system. 


10.1.3. Parallel correction 


This type of corrector is grafted in parallel on an element of the direct chain as 
shown in Figure 10.5. 
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e(t) + e(t) ©) u(t) LA SD 


v 
Le] 


Figure 10.5. Parallel corrector 

















u can possibly be a serial corrector. x is an intermediary magnitude between 


sO 
t 


the control and the output. If v contains a term of the form , we say that we 


realized a tachymetric correction. 


It should be noted that this corrector acts essentially on the dynamic stability and 
precision and not on the static error (this corrector cannot introduce integration). 


10.1.4. Correction by anticipation 

These correction techniques are used only as a complement of looped correction 
techniques. They rely on the injection of signals in open loop in order to minimize 
the transitions felt by the main correction loop, following the input variation or an 


external interference. 


10.1.4.1. Compensation of interferences (zero input) 














e(t) + elt) 








[B 


Figure 10.6. Corrector by anticipation 
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Let us suppose that the interference is measurable. It is then possible — at least 
theoretically — to eliminate the influence of the interference b(f) with the help of a 
corrector by anticipation C(p) described by the diagram in Figure 10.6. 


In fact, it is enough to choose: 


1 
C = — 
MS 


to cancel the effect of b(f) on the output. 


We note in this case that the stability is not affected, nor is the precision with 
respect to the input. 


Most often, AF is not physically feasible, which leads to adopting 
MatP 





approximate forms of . There is no perfect compensation of the transient 


Uat P 
state of interferences. 


EXAMPÉE 0 I if mo Né cod ke CG) == — wate eS 
P K (l+) 


than the main time constants of 43. 


10.1.4.2. Compensation of the input (zero interference) 








u(t) s(t) i 











[B 


Figure 10.7. Input compensation 


This diagram can be considered, with respect to the error, as the overlapping of 
the two diagrams of Figure 10.8. 


292 Analysis and Control of Linear Systems 


e(t) + e(t) A 








e(t) [co] + -Wb E2(t) 








Figure 10.8. Diagram decomposition of the input compensation 


We have: 


E(p) _, E(p)C(p)(-#26) 
I+ My B 1+4 p 





E=E tE = 


In order to cancel the error with respect to the input, it is enough to choose 


1 
C(p)=——. 
e Up 


It should be noted that in this case the closed loop control system perfectly 
follows the input law, without introducing the integration in the direct chain. The 
stability of the system is not modified, nor is the influence of interferences with 


Ps & : f 
respect to the error. Most often, a is not physically feasible. Thus, the 
M2 


compensation will not be perfect in transient states. 
10.1.5. Conclusions 
The feedforward correctors, when they are feasible, do not modify the stability 


of the loop and compensate either the error due to the input, or the effect of 
interference. In a complex case (several interferences, some of which are non- 
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measurable), the implementation of these compensators can be difficult and 
imperfect. 


The “parallel” correctors generally modify an element of the direct chain, 
without introducing integration. They can be used in order to improve the stability. 


The “serial” correctors modify the stability and precision with respect to the 
input and interferences. This mode of correction 1s the most widely used. 
10.2. Serial correction 
10.2.1. Correction by phase lead 


10.2.1.1. Transfer function 


Generally speaking, we call a phase lead corrector a corrector whose transfer 
function has the form: 





1 
D Re 
1+aTp 


This corrector can be physically created by the circuit of Figure 10.9. 


C 
| 


e(t) Wy UD 














Figure 10.9. Phase lead network 


R 
with T=R,C and a=—2—. 
R +R) 
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It should be noted that the resistive bridge alone brings about an attenuation to 


: ae f . K 
the low angular frequencies, which justifies the presence of an amplifier of gain —. 
a 


The Bode graph is represented in Figure 10.10. 


| CG) | À 
(0)20 log K/a 





20 log K ! | 


1/T L/TVa  1/aT Log © 








D 
VT  1/TVa  1/aT Log © 





Figure 10.10. Bode graph of a phase lead network 


The tabulated values of the maximum phase input @,, (obtained for 


o=1/TVa ) are given for your information (Table 10.2), in practice we can use the 





. 1-a . . . 
formula sin @,, = ; “ in order to find the value of a corresponding to a desired #,, 
+a 


value. 





a 1/4 1/6 1/8 1/10 1/12 





m 37° 45° 51° 55° 58° 


























Table 10.2. Maximal phase lead 
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10.2.1.2. Action mechanism of a phase lead corrector 
There is no general method to configure such a corrector. 
EXAMPLE 10.2. 


Let us examine on a current case the problems linked to this configuration (see 
Figure 10.11). 








e(t) + K, s(t) 
PO + tp) me 








Figure 10.11. Uncorrected system 


The OL of the uncorrected system presents an integration. An integral action is 
hence useless in order to ensure a zero static error for a step function input. 


Let us suppose that we want a response time f¢, imposed in closed loop, or a 


: pa ; 1 
dynamic precision given to an angular frequency Wo <<—. 
T 


Thus, we have to introduce a minimum gain K (proportional action) in the chain 
that leads to an insufficient phase margin: 











KK KK 
HB= ES with K such as RES = Gys 
p(lt+ p) PUP) = jen, 
KK, ; 1 0 P : 
or = Gm if Op <<—. Gm is minimum gain required at the angular frequency 
Mo T 
© 


We notice on the graph in Figure 10.12 that the phase margin becomes 
insufficient. A correction is thus necessary, for example, a phase lead correction. 
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Figure 10.12. Bode graph with insufficient phase margin 
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Figure 10.13. Bode graph in OL of the corrected system 
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Since this corrector is in series with 4f , there is an addition of gain and phase 
curves in the Bode plane (Figures 10.10 and 10.12). 


NOTE 10.1.— it should be noted that the open loop system without correction does 
not shift phases beyond —180°. Consequently, a phase lead corrector chosen for 
a = 1/6 —if it is conveniently placed — can bring a 45° phase lead, which is sufficient 
to ensure a convenient stability degree for the system. 


For that, we cut the axis 0 dB at the angular frequency w', =1/T Va where the 
phase lead of the corrector is maximal (environment in Bode representation of the 
segment i T,l/aT J). Parameter a being chosen as well as the relative position of 
wo", we verify if the value of minimum gain requested is satisfied and if not we fail 
the position of @, . 


The resulting phase margin is calculated by adding the residual phase of the 
uncorrected system to @e with respect to —180° and the @,, of the corrector. It 
should be noted in this example that the resulting phase with respect to —180° is 
always positive. The looped system will thus always be stable. 


EXAMPLE 10.3.— it is not always like this, in particular when the OL transfer 
function of the uncorrected system shifts phases more than — 180°. 


: K : 
Let the OL system before correction be: with 7) >T>. 





pU+7,p)1+ Tp) 


0 dB 


-90° 





-180° 5 





-270° D 
hry, 1/T Log (63) 





Figure 10.14. OL system before correction 
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The phase margin of the corrected system is obtained by adding to the Ø, of the 
corrector 180° plus the phase of the initial system to w',.: 


Ad =O) + Gn +180° 


We note that this last term can be negative; hence we will need to choose a more 
efficient corrector (smaller a). 


When the phase curve of the system before correction decreases too fast, the 


phase lead caused by the corrector risks being considerably contrasted at point a, . 
The correction by phase lead can thus prove to be insufficient. 


0 dB 


-90° … 
-180° 


-270° 











Figure 10.15. Correction by phase lead 


In conclusion, the phase lead corrector: 
— increases the bandwidth (a, >We); 


; : ' : l-a , ; 
— brings in @, a phase lead ¢,, defined by sin @,, = Gat if the corrector is 
+a 
properly centered so that its central frequency coincides with the cutting frequency 
of the corrected system. This choice of w. avoids the complete construction of the 
phase in order to determine the phase margin obtained. 
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When the correction is perfectly set, in order to determine the final performances 
of the closed loop corrected system, a complete construction of the module and 
phase is necessary in the Bode plane. Therefore, a graph in Black-Nichols graph will 
make it possible to obtain the closed loop transfer function. 


10.2.2. Correction by phase delay 


10.2.2.1. Transfer function 


Generally speaking, we call phase delay corrector a corrector whose transfer 
function has the form: 


a kasi 
1+aTp 





C(p)=K 


This corrector can be physically created by the circuit represented in Figure 
10.16. 





e(t) LR À > uo 
Ry ml 


R2 


Figure 10.16. Phase delay network 


Ry +R) 
x g 


With T = RC and a= 


The phase delay corrector has the Bode diagram represented in Figure 10.17. 
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Figure 10.17. Bode diagram of a phase delay network 








l/a 1/4 1/6 1/8 1/10 1/12 





Om 31 —45° —51° 55° —58° 


























Table 10.3. Maximal phase delay 


10.2.2.2. Action mechanism of these correctors 


The effect of these correctors is to increase the gain at low angular frequencies 
and thus improve the static precision without modifying the behavior at high angular 
frequencies. 


In addition, we can use the phase delay corrector to make the gain fall at high 
angular frequencies without modifying it at low angular frequencies. 


With the help of an example, let us see the effect of this corrector. 


e(t) + e(t) u(t) K, s(t) 
pCI + t1p)(1 + mp) | g 


Ty > T2 











Figure 10.18. Phase delay corrector 
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The desired performances are: 


— zero static error; 
| RAR 1 
— dynamic precision imposed at Wp <<—; 
T 
— stability degree imposed by a phase margin A@ from 45 to 50°. 


The static error with respect to the input is zero (presence of one integration in 
the direct chain) in the absence of interferences. The graph of the OL which makes 
it possible to define the minimal proportional action K, ensuring the desired 
precision is given in Figure 10.19. 


A phase lead correction is impossible for such a system because the phase of the 
OL decreases too fast near we. Since the integral action is useless, we can then place 
a phase delay corrector. 














Figure 10.19. OL graph 


This corrector must not affect the gain on the side of@g, hence the choice 


1 | F : Pe f 
T = Op. The correction will be even more efficient for the stability as the gain 
a 
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curve will cut the axis 0 dB with the biggest slope segment possible (—1) and near its 
environment. 


1+7p 


Due to these considerations, we adopt as corrector C(p)=K E with 
+aTp 


1/aT near Wp. Parameter a will be determined by successive tests. The phase 
margin after correction can be estimated as follows: 

— calculate the ratio 7} /T characterizing the “length” of the slope segment (—1) 
defined above; 


— if the new gap angular frequency O., chosen near the geometrical 


environment of i T, 1/ z] , is much higher than @ọ (@, >10@,) and much lower 


Ta e 107; 


carrying over the ratio 7,/Tin Table 10.2 for the phase lead corrector. After 


' 1 in he Micra 
than eo Lo < al , then an approximation of the phase margin is given by 


correction, we obtain the Bode diagram in Figure 10.20. 














Figure 10.20. Bode graph in OL of the corrected system 


It should be noted that near the angular frequency w. , the contribution of the 
two breaks, (—1) to (-2) in @p and (—2) to (3) in 1/7; , is negligible. Everything 
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takes place as if only the slopes (—2), (—1) then (—2) remained with respect to the 
phase. This graph is that of a double integrator in a serial connection with a phase 
lead corrector. 


It should be noted that the phase delay correction happened in this particular 
example because the ratio between 1/7, and Wg was high, but this is not always 


the case (see the combined action correction). 


The phase delay corrector decreases the bandwidth (a, <q, ) and the constant 
stability static error (phase delay for non-integrator systems). 


There again, once the correction is set, it is necessary to build with precision the 
total OL in order to deduce the performances in CL. 
10.3. Correction by combined actions 
10.3.1. Transfer function 


Generally speaking, we call a lead-delay combined action corrector a corrector 
whose transfer function has the form: 


(1+ 7p) (1+73p) 


C =K 
(P= Br) Tp) 





with Ti >T > T3 > T4 


This corrector is obtained by serially connecting a phase delay corrector 


1 Cr) and a phase lead corrector K, (+73p) 


(1+7,p) (1+74p) 


The Bode diagram of such a corrector is represented in Figure 10.21. 


with KjK, =K. 
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| CGo) | 





20 log K 





Figure 10.21. Bode diagram of a lead-delay network 


10.3.1.1. Action mechanism of these correctors 


These correctors combine the actions previously studied. They are used when the 
simple action correctors do not lead to the performances desired. 


EXAMPLE 10.4. 











e(t) + e(t) u(t) Ky s(t) 
pd + 4p) + tp) = 








Figure 10.22. Combined action corrector 


In Figure 10.22, we have 7; > 72 and as desired performances: 


— zero Static error; 


: ere 1 
— dynamic precision imposed at Wg (close to — ); 
Ti 


— phase margin A@ from 45 to 50°. 
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0 dB 


90° 


-90° 


-180° 


-270° 





Figure 10.23. Phase delay correction 


The phase delay correction is inefficient here (proximity of angular frequencies 
1 : : ; ma 
@g and —) and therefore in order to obtain the desired stability, a phase lead term 
Ti 
must be added. 


The corrected OL is represented in Figure 10.24. 
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Figure 10.24. Bode graph in OL of the corrected system 


We can estimate the phase margin of the corrected system, if w. is close to the 
environment of slope segment (—1), by referring to Table 10.2 of the phase lead 
circuit. This would be even more justified in the present case since @, >10@ 9 and 
©, <1/1074. 

10.4. Proportional derivative (PD) correction 


10.4.1. Transfer function 


A proportional and derivative action corrector has as transfer function: 
C(p)= K(+Tap) 


The derivative action is not physically feasible and it is approximated by a 


T, 
transfer function of the form: T with T << T4. 
+p 
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The new corrector has then as transfer function: 


T 14+7 
C(p) = ar z Kr) 
l+% l+ 


Therefore, the PD corrector has a transfer function which is equivalent to that of 


a phase lead corrector. The same methods could be used in order to determine K, T4 
andT . 


10.5. Proportional integral (PI) correction 
10.5.1. Transfer function 


A proportional and integral action corrector has as transfer function: 


2 1 )_ {1+7;p 
Cw) aug) « z | 


The proportional integral regulator (PI) has as its Bode diagram that in Figure 
10.25. 











| CGe) | À 
(-1) 
(0) 20 log K 
! = 
1/T; Log œ 
7 | 
s l je 
Log œ 
90° 





Figure 10.25. Bode diagram of a PI corrector 
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The essential function of a PI regulator is to bring one integration in the open 
loop. It is indispensable if the specifications stipulate that a zero static error is 
needed in response to a step function when the process to control does not have one. 
It is also indispensable if the process is an integrator but it is subjected to an 
interference injected upstream from the process. If we desire a zero static error for a 
step function interference, it is essential to add an integrator at the level of C(p). 


The following example (Figure 10.26) is an illustration of the correction of a 
non-integrator process. 





s(t) R 











Figure 10.26. Correction of a non-integrator process 


The desired performances are: 
— zero static error; 


— imposed bandwidth {0,&,.} ; 


— stability degree imposed by a phase margin A@ from 45 to 50°. 


An integral action is necessary in order to cancel the static error. However, it 
cannot be alone because then the system could not be stable. 


The graph of the Bode diagram of the open loop without correction makes it 
possible to place a PI corrector (see Figure 10.27). 
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Figure 10.27. Bode diagram of the uncorrected system in OL 


Let us introduce a PI corrector (represented by a dotted line in Figure 10.27) so 
that angular frequency 1/7; is near œ, /10 in order not to modify the phase margin 
A@. The corrected curve in open loop is represented in Figure 10.28. 








Figure 10.28. Bode diagram of the corrected system in OL 


This corrector provides the desired performances to the closed loop. 
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10.6. Proportional integral proportional (PID) correction 


10.6.1. Transfer function 
An ideal PID corrector has as transfer function: 


c= x israp] 
T;p 


1 


or: 





2 
cu METTRE | 


Tip 


1 


The roots of the numerator are: 


T, 4T T; AT 
RENNES de Me [1-4 
2 T, 2 T; 


: 1 
with T? —4T;T4 20 the zeros are real and placed in = and N 
1 2 


| CGæ)| 








Figure 10.29. Bode diagram of a PID corrector 
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TT: 
We note that T; = T) +T, and Ty rie 
T, +T> 
1 . : 1 1 1 1 1 
If the break — is much higher than — we have =— and = ; 
T, Ti T T; T Ta 
Tap 


In practice, the derivative action will always be approximated by i : 
+p 


EXAMPLE 10.5.— integrator system affected by a disturbance (for example, level 
regulation with flow disturbance) with 7; > 7). 


e(t) + e(t) u(t) K, s(t) 
PCI + tp)(1 + tp) = 


Figure 10.30. /ntegrator system affected by a disturbance 

















Desired performances: 
— zero static error with respect to the set point and disturbance; 
— phase margin Ag from 45 to 50°; 


— dynamic precision imposed at angular frequency &o. 
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PI + derivative approach 








Figure 10.31. Uncorrected system 


In order to cancel the static error with respect to the disturbance, we will have to 
introduce an integral action in the corrector C(p). The proximity of angular 


5 1 1 à 

frequencies @g, — and — forbids us to use a PI corrector. Let us try a PID 
Ti T2 

corrector. 


The double integration leads to a slope graph (—2) at low angular frequencies that 
has to go through point A (minimum gain at œg). On the other hand, the phase 
margin imposes to cut the axis 0 dB with a slope segment (—1) on a convenient 
length and follow by a slope (—2), from which we obtain the chosen corrector. 
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0 dB 





-180° 
-270° 





Figure 10.32. Bode graph of the corrected system 


The same observations as above on the estimation of the phase margin apply 
here. 


10.6.2. Experimental adjustment method 


We will adjust a PID in order to control a system whose transfer function is 
unknown. 


e(t) + e(t) u(t) s(t) 











Figure 10.33. System with unknown transfer function 
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In certain cases, the transfer function of a system is unknown; hence the 
adjustment of a PID cannot use the methods previously presented. Thus, there are 
experimental adjustment methods. We shall keep in mind the Ziegler-Nichols 
method (sustained oscillations). This method is suitable only for a CL study. The 
adjustment is done as follows: 


— neutralize the integral actions (T; =) and derivative (7, = 0) and therefore 
there remains a simple loop with proportional action; 


— increase gain K of the proportional action until the system becomes oscillating; 


— after obtaining a balance of oscillations, increase period Tọ (expressed in 
seconds) of these sustained oscillations and the limit gain K jy . 


The corrector can be adjusted with: 


— K =0.6K,,, in order to have sufficient stability margin; 


aes i T ‘ 
— the derivation constant is chosen at Ty = a whose effect is to accelerate the 


proportional action; 


; : To . : 
— the integration constant is defined by T; =e in order to cancel the static 


error. 


It should be noted that here T; = 477, 1.e.: 


T: 


1 


2 
1+2T 
ces rap) EPL 


1 


NOTES ON THE SERIAL CORRECTION.— the methods previously described aim 
at quickly deciding if the given correction is adequate. On the other hand, it is 
obvious that there is only one corrector satisfying this problem. The choice will be 
made after the testing of several correctors and taking into account criteria such as: 
cost, technological simplicity, reliability. 


It should be equally noted that the correction does not make it possible to 
infinitely increase the precision and speed (increase obtained in general through the 
increase of the gain leading to strong saturation risks). In fact, when the system 
saturates, the linearly defined performances are not reached (increased response time 
and limited precision). On the other hand, irrespective of the linearity of the system, 
it is not always interesting to have a too extended bandwidth, due to the 
amplification of noises. 


Synthesis of Closed Loop Control Systems 315 


10.7. Parallel correction 
10.7.1. General principle 


This type of correction is represented by the diagram in Figure 10.34. 


e(t) + e(t) Q u(t) sO 


traj- 
Le] 


Figure 10.34. Parallel correction 

















C(p) must be calculated in such a way as to obtain the performances required. 
These performances (precision, stability) can be translated into characteristics on the 
OL graph in the Bode plane. The problem is thus to determine C(p) in order to 
obtain a corrected open loop of convenient shape. 


The corrected open loop has as a transfer function: 





Let us consider the two following cases: l4C(jo)| >> 1 and lu C(o)| <<1. 


Case 1: 


1 
Cjo) 


Ho 
1+42C(jo) 


|u.C(ja)|>>1 = 














the corrected open loop has as an approximate transfer function: 


A (puz (p) B 


Cp) (p) 


316 Analysis and Control of Linear Systems 


Case 2: 


U2 


nC jo) <1 > |—————— 
EE ER +400) 


= |u (jo) 








the corrected open loop has as an approximate transfer function: 
Hy (p)42(p)43(p)B(p) 


Let us trace the variations of these two transfer functions in the Bode plane. The 
curves are cut in points A and B as indicated in Figure 10.35 (one of these points can 
be infinitely rejected). 


| My Hops 7 | MC | max 





Figure 10.35. Variation of the two transfer functions 


In these points we have: 





[ou B| = fa 
i.e.: 
[22 | = k 
C 
or |uC|=1. 


Points A and B are thus the separations of cases | and 2. 
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Uitz Ê 
C 





Between À and B: |u Mr Ls B| > , Le. uC >1 (1* case) and outside A 








M38 
C 





and B: |u,u 13 B] < „i.e. [AC] <1 (2% case). 





An approximation of the corrected OL is represented with a line in Figure 10.35. 
This approximation may not be good near points À and B, since [uC ee is low 
(see Figure 10.35). 


10.7.2. Simple tachymetric correction (C(p) = Ap) 


We will show in an example how this type of correction is naturally introduced. 


e(t) + e(t) + u(t) K, 1 s(t) 
= “© 1+7p á 
C j — 


Figure 10.36. System to correct 

















The desired performances are: 


— zero theoretical static error; 
— minimal bandwidth w, = = a 
a 
— stability degree with a phase margin Ag = 45°. 


Without C(p) correction, it is necessary to choose a minimum gain K in order to 


maintain the constraint ©, > @ in: 


318 Analysis and Control of Linear Systems 






B 
og Oc = 8/t 


3 octaves 








Figure 10.37. Bode graph of the OL with 45° phase margin 


On the other hand, we have to make sure that the integration in the corrected OL 
is maintained (ensuring a zero static error). 


Let us choose C(p) so that 


PAP a f has the form represented in Figure 10.37. 


This graph leads to a 45° phase margin (the break is set at ©, = 2 ). 
T 


In the example of Figure 10.36, u, = K, U3 = 





K 
x maS pei and point 
(+p) P 


A is rejected at © =0(- in the Bode plane). 


Hence, we have: 


K 
Cp 


Utz Ê 
C 

















- : | (equation of the slope (—1) cutting the axis 0 dB in 
p 


aes 
T 
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By identifying: 
KT 
C(p) = g 


Hence, we see that the output C(p) is homogenous with the derivative of the 
output of the feedback control. We say we are dealing with a simple tachymetric 
correction (C(p)=4). 


In this precise case, a direct calculation by decrease of the secondary loop is 
preferred. Let us choose C(p) = À. 


The corrected OL has as a transfer function: 














K, K, 
K l+p K K, 7 1+4K, _ KK, 
Pig AK, p (1+ AK, +) HR 1+7 p 
1+% 1+AK,, 
K i T 


where K A = 





v T= 
1+2K, | 1+4K, ` 


In order to have the same graph of the new OL as before, we must choose 
parameters K and À in order to have 45° phase margin A6, such as: 














1 8 
Der 
T T ! 
Y=] = KK,=- V2 
KK, 8 
: 1 J2 
1 T 
PUHT P)| jo, 


64/2 


EE AE E ie TE eee 
TK K 
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or: 





7 TKt KT 
C(p)=—= = 
K, 6492 13 


10.7.3. Filtered tachymetric correction 


We will see in an example how this type of correction is naturally introduced. 


et) FeO t e LO 1 s(t) 
LE, - 


K, 
(1 + tp) + təp) 


Figure 10.38. System to correct 











The desired performances are: 


— zero theoretical static error; 
pae : 1 
—minimum gain G,, at @) = —; 
8T 
— 45 to 50° phase margin. 


Without correction, in order to have the desired gain at @) we have to choose 
K such that: 


KK, 
pA+7p)A+7p) 





SG 








Pp=j@ 


1 1 
where Wo << — <<—. 
Ti T2 


vad ; KK, 
Hence, it is sufficient to have > Gm- 
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Let us choose C(p) so that has the form represented in Figure 10.39. 





M138 
C 


This form was chosen in order to deform the OL according to the example in section 
10.3 (correction by combined actions). 


The corrected OL has the following form (we note that the quadrilateral ABXA 
is a parallelogram). 


0 dB 














Figure 10.39. Corrected OL 


In the example of Figure 10.38: 





K 1 
M =K, My = 2 » H3 = , B=1 
(1+ 7, p)(1+ 7p) P 
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and fa = i = ataei . Hence, the expression of the corrector: 
C Cp ATp 
AT T 
C(p)= = 1 
1+7p 1+7p 


The realization goes through the reinjection of the speed through a high-pass 
filter created by the circuit given in Figure 10.40. 





Figure 10.40. Speed reinjection circuit 


RCp 


The transfer function of this circuit is ; 
1+ RCp 





Such a correction is called filtered tachymetric correction. 


T is obtained by direct reading of the diagram and À is determined, for example, 








by writing that ASE has a gain of G,, in © = Wo (point A). 
K(1+7} 
(i+ 2e 
ATp Ls 
P=J@% 
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In general, © «= > Gy = Le À i 


" ATo? GT 
It should be noted that this correction was successful because the variance 
; 1 | , 
between points À (@ọ) and C (=) was sufficient (3 octaves). If points A and C 
T 


(hence B and D) are too close (2 octaves or less), such a correction cannot provide 
the sufficient phase margin. 

Here again there are no general criteria for the choice of the corrector. The 
parallel corrections prove to be particularly practical as soon as the measurement of 
the reinjection variables is available (speed, acceleration, etc.). 

Finally, we should remember that the asymptotic graphs are approximate and 
that it is good to verify the corrected diagrams by using, for example, the Black- 
Nichols graph. 


10.7.4. Correction of delay systems: Smith predictor 


Let us assume that the system in Figure 10.41 needs to be corrected. 


s(t). 








Figure 10.41. System to be corrected 


The negative effect of the delay on stability is obvious in the fast rotation of the 
phase. Since the principle of causality prevents the delay compensation by a time 
advance, the solution proposed by Smith consists of only rejecting the delay outside 
the loop. 


A possible structure of correction is represented in the diagram of Figure 10.42. 
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Oe Ae SOF upper? |} Om 

















Figure 10.42. Correction structure 


The corrected transfer function in closed loop is written: 











Cg? 
1+CC' 7 Cue"? 
ig eC T 
1+CC 


By choosing C ‘= uBü-e ?), which is physically feasible, the closed loop 


becomes: 


Cu oP 
1+ Cup 





An equivalent block diagram could be that in Figure 10.43. 


e(t) + e(t) SD 














Figure 10.43. Example of block diagram 


We notice that: 
— the delay is obtained from the loop; 
— its negative influence on stability is deleted; 


— however, it subsists between the input and the output. 
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A reliability study will be necessary in order to evaluate the incidence of a 
variance between the pure delay of the process and the one estimated introduced in 
the corrector C . 
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Chapter 11 


Robust Single-Variable Control 
through Pole Placement 


11.1. Introduction 


Partially originating from the adaptive control, RST control appeared in books 
around 1980 [AST 90, FARG 86, LAN 93]. Curiously, this approach was 
systematically described for the numerical control, perhaps because of its origins 
mentioned above and in [KUC 79]. In fact, this polynomial approach is the 
traditional correction with two degrees of freedom, a combination between feedback 
and feedforward on the setting. The primary goal of this chapter is to replace this 
order in a general context and to show all the degrees of freedom available to the 
designer. Then, a very simple, even intuitive, methodology is proposed in order to 
use these degrees of freedom to achieve a certain robustness of the structure created. 


11.1.1. Guiding principles and notations 


Figure 11.1 shows the block diagram of the RST control. Block diagram because 
transfers R, S and T are polynomials and are thus not proper. 


Chapter written by Gérard THOMAS. 
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d 
7 + 
rR] a 


Figure 11.1. Block diagram of RST control 








In all that follows, unless otherwise indicated, the systems studied will be 
discrete or continuous, i.e. will be respectively described by transfers of the z or s 
variable. For reasons of simplicity and coherence, the examples will be treated in the 
continuous case. 


As for any correction structure, the designer will have to determine the 
correction parameters (here polynomials R, S and T) to ensure: 


— internal stability [DOY 92]; 
— the asymptotic follow-up of a certain class of settings; 
— the asymptotic rejection of a certain class of interferences; 


—a Satisfactory transient state. 


However, respecting these specifications is not sufficient to ensure a satisfying 
operation of the installation; it will be necessary to take into account: 


— the saturations of the process; 
— the level of measurement noise; 


— modeling errors. 


The non-compliance with these simple rules has had negative impacts on 
automatic control, which is then considered as a highly theoretical discipline whose 
industrial applications seldom exceeded the performances obtained by PIDs. 


The guiding principle of the RST control is to calculate the polynomials R, S and 
T to obtain: 


C AS+BR A, 
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which will be satisfied if: 


(a) BI =BmAy 


[11.2] 
(b) AS +BR = An4o 


We observe that the unknown factors of the problem (R, S and 7) are the 
solutions of polynomial equations. In particular, the latter is well-known by 
algebraists as a Bezout equation or Diophantus problem. That is why the following 
section is dedicated to some reminders on polynomial algebra. It shall be noted that 
this formalism was extremely well emphasized in [KUC 79] within the multi- 
variable and discrete context and it is the starting point of the next section. 


11.1.2. Reminders on polynomial algebra 


A certain number of traditional results on polynomials is gathered here, in order 
to solve the general polynomial equation: 


AX +BY =C [11.3] 


We must point out that here we are interested in the single-variable case where 
A, B, X, Y, C are polynomials and not matrices of polynomials. Thus the 
multiplications can be written in a random order. 


THEOREM 11.1.— the set of the polynomials with an unknown quantity on a 
commutative body is a commutative unitary ring. 


The ring of polynomials on R will be called Rix]. We note by: 
— 1 the identity polynomial (the neutral element of the multiplication in Rix); 


— 0 the zero polynomial (the neutral element of the addition in Rix), 
— 0A the degree of polynomial A. 


We will assume that the concepts of polynomials division are known, as well as 
those of PGCD and PPCM of polynomials. If G and L are respectively the PGCD 
and the PPCM of A and B (A, B, Gand Le Rix), we will write: 


G=AABand L=AvB 


330 Analysis and Control of Linear Systems 


DEFINITION 11.1.— we say that several polynomials are prime among themselves 
when their PGCD is of 0 degree, i.e. when their only common divisors are non-zero 
constants. 


THEOREM 11.2 (BEZOUT THEOREM).— a necessary and sufficient condition for 
n A; polynomials to be prime among themselves is that there are n V; polynomials 
such that: 


Ÿ A7 =] [11.4] 


THEOREM 11.3 (BEZOUT EQUALITY).- since A and B are two polynomials 
prime among themselves, other than constants, there is only one pair of polynomials 
X and Y verifying: 


XA+YB=C with OX <0B and ƏY < ðA [11.5] 


THEOREM 11.4 (GENERALIZATION).- if A and B are two polynomials of PGCD 
G, then there is only one pair of polynomials X and Y such that: 


pese. JET [11.6] 
7 OY <d4-0G | 


THEOREM 11.5.— equation XA+YB =1 [11.7] has a solution if and only if the 
PGCD of A and B divides C. 


THEOREM 11.6.— let (X0, YO) be a particular solution of XA + YB = C [11.8] and 
let A; and B; be two polynomials prime among themselves such that A = AG 
[11.9] and B = B,G [11.10] where G = A ^ B ; thus the general solution is given 
by: 


X=X)-B,P 
| tis [11.11] 


Y=Y+AP 
where P is any polynomial of Rix]. 


Among all these solutions it is usual to seek a single solution which confirms a 
particular property. The most usual is the solution of minimum degree. 
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Let (Xo, Yo) be a particular solution of [11.3]; we know (Theorem 11.6) that the 
general solution is written: 


X=X)-B,P 
| ee [11.12] 


Y=} + AP 
with A = 4,G and B = B,G where G = A A B and P is any polynomial of Rix. 
By carrying out the Euclidean division of X, by B, we obtain: 
Xo=BU+V with 0V<oB, [11.13] 
by replacing in [11.12] we obtain: 
X= V-B,(P-U) [11.14] 


the solution for [11.3] with minimum degree in X will be obtained for P = U or: 


X=V 
[11.15] 
Y = Y)+A,U 


Indeed, based on [11.14]: 


OX < max{0V,0B,(P-U)} [11.16] 
If P + U, then: 
OB, (P - U) = oB, [11.17] 


and since by construction: 


ƏV < 0B, [11.18] 


max{0V,0B,(P-U)}>90B; [11.19] 
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the hypothesis P + U leads to a solution in X of a higher degree than that obtained 
for P =U. 
Q 


NOTE 11.1.— the solution of minimum degree for X does not generally coincide 
with the solution of minimum degree in Y. 


The preceding theorems make it thus possible to calculate the solution for [11.3]. 
Now that the resolution tools of polynomial equations are known, it is advisable to 
specify in relations [11.2] the degrees of freedom available to the designer and also 
to equally translate the constraints of synthesis related to the nature of the problem 
and the specifications of the correction. 


11.2. The obvious objectives of the correction 
11.2.1. Internal stability 


It is difficult to take a final decision at this stage since the representation of the 
correction given in Figure 11.1 is formal and does not represent the real 
implementation. However, it is clear [DOY 92] that the denominator of all the 
transfers being A,A,, these two polynomials must be stable (besides the 
simplification carried out by A, in [11.1] already supposed the stability of 4%); on the 
other hand, there should be no simplification of unstable root of A or B by the 
correctors built. On the other hand, the reverse is possible, i.e. we can choose some 
of the polynomials R, S and T in order to carry out such simplifications. 


Y BT 
Thus, based on the transfer in closed loop, — = ————— 
C AS+BR 


zeros and (stable) poles of the model of the process by using S or R. Let us note, 
following the example of [AST 90]: 


it is possible to hide 


A= A*A and B = B*B™ [11.20] 


where P'P represents the spectral factorization of the polynomial P, the roots of P* 
being all stable!, the roots of P` being all unstable. By supposing that: 


S = B*S' and R= AŸR' [11.21] 





1 Open left half-plane for the continuous systems, the open disc of unit radius for the discrete 
models. 
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we get: 


Y BT B'BT 
= = [11.22] 
C  AS+BR A'B'(A S'+B R') 





The choice of T = A*T' makes it possible to simplify by A*B*. We are in fact 
brought back to the preceding problem where R, S and T are replaced by R’, S and 
T’, and A, B by A, B. This is why subsequently, unless told otherwise, the 
simplifications will not be mentioned. 


11.2.2. Stationary behavior 


Since the internal stability is guaranteed, it is now possible to deal with the 
following stage, namely with the stationary behavior. The specifications of the 
correction outline the settings and interferences likely to stimulate the process. Let 
e(t) be the error signal (not explicit in the correction structure in Figure 11.1) 
neglecting the supposed noise of zero mean value: 


_AS+B(R-T),, BS 
AS + BR AS + BR 


E=C-Y 





[11.23] 


Generally, the authors [AST 90] then use [11.2 (b)] to simplify the expression of 
the contribution of the setting. In this case, the stationary behavior with respect to 
the order depends only on A,, and Bm the asymptotic follow-up of a step function 
setting resulting in the choice of a reference model of unit static gain. However, as it 
is noticed in [COR 96, WOL 93] this supposes a perfect identification of the 
procedure! In fact, the relations [11.2] are only true for the model of the procedure. 
Let A’ and B' be “the true” values of the denominator and numerator of the 
procedure; the real error obtained through the implementation of the RST corrector, 
calculated using model A, B, will in fact be: 


_AS+B(R-T), B'S 





E=C-Y [11.24] 
AS + B'R A'S + B'R 
and of course A'S + B'R + Am 4,. We suppose that: 
N 
C= and D = —4 [11.25] 








Di Dz Di Da 
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where N, and D, are polynomials prime among themselves, the indices + and — 
having the same significance as in [11.20]. Thus, for a continuous ramp setting we 


will have D, = s? and for a sinusoidal disturbance of angular frequency ,, 


Dj =s +2. 


By supposing that the calculated correction is sufficiently robust so that 
A'S + B'R has all its roots stable, the stationary error will be cancelled only if D, 
divides AS + B(R-T) and Dj divides BS. As seen above, the values of A and B 
are not exact and thus it is R, S and T that will provide this function2. The stationary 
specifications thus lead to imposing the following constraints (without taking into 
account possible integrations of the process): 


S=D,S' S = DisS] 
R-T=D,L or R-T=D,L [11.26] 
S = Da S" Dy = D; v Dg 


The preceding section s made it possible to set a certain number of constraints on 
the unknown factors of the problem and provided a general context for its solving. 
The following section will provide a calculation tool for polynomials R, S and T. 


11.2.3. General formulation 


We must solve [11.2] with the conditions [11.26], or: 


GED 
BE = Pint with |R—T = DIL [11.27] 
AS + BR = AA, c 

DR = D; v D3 





2 When the process is integrator we can write A = sA’ despite identification errors. 
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Since BT = B,4,, B must divide the B,,4, product. We saw above (section 
11.1.3) that polynomial A, must be stable and thus it can share with B only stable 
roots. Let B, be the part of factorized B in A3. Consequently, polynomial B,, must 
“become in charge” with the non-factorized part of B in A,. Hence, let us assume 
that: 


B=BB 4, = BA! Bm = BoBm' [11.28] 


Therefore, B will have to contain at least all the unstable roots of B. Taking into 
account these factorizations, we obtain: 


T = Byy'Ag [11.29] 


On the other hand, according to [11.2(b)], since B/ divides A, and B it also 
divides AS. However, A and B are prime between themselves by hypothesis and 
therefore Bı divides S and S, (since B; is stable and not Dj, ). Finally, we can write: 


(a) B=B,B, B; stable 

(b) Dae = De Y Da 

(c) ADS + BR = An4o' 

(d) DIL+Bn'A'=R [11.30] 


(©) S=DxB,S 
(f) T=By'Ay' 
(g) Bm = B2Bm' 


All these relations express the respect of internal stability (by supposing of 
course Am and A,’ stable) and desired stationary performances. We notice that these 
relations require the choice of polynomials (4,,, 4o ) and the factorization of B and 
then the solving of two Diophantus equations [11.30(c)] and [11.30(d)]. The 
following section is dedicated to the complete resolution of [11.30]. In particular it 
will be pointed out which are the degrees of freedom available to the designer in the 
choices mentioned above. 





3 We will have a maximum of B; = B* according to the notations in section 3.1.3, equation 
[3.20]. 
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11.3. Resolution 


As previously seen, it is possible to develop a general solution (Theorem 11.6) 
by formal calculation. However, it is more usual to solve the Diophantus equations 
resulting from this approach by using linear algebra. This approach makes it 


possible to set the degrees of freedom of the designer. Indeed, if we write*: 


ðA 
A= Ya;s' 

i=0 

oX | 
X= xs 
i=0 


oB ; 
B= bjs’ C 
i=0 
oY > 
Y= Lys 
i=0 


= est 


[11.31] 


The resolution of equation [11.3] goes back to that of the following system: 


[ap 0 0 
ay ap 0 
>o ap a 

-E 

Gon 
0 a, 

0 aay 
0 





o 0 
b, b 
b, 

Ds 
0 ba 
0 




















0 Ce Cy 
0 o` Xi C} 
bo 
b, 
xx E 
| Yo 
bas 
0 
ILYav J [eac | 


[11.32] 


Each row is obtained by equalizing the terms having the same power in [11.3]. 
This system is called the Sylvester system. The resolution of this system of 
equations requires knowing the degrees of the various polynomials and that part has 
not yet been set. It must be noted that in our problem, Æ and B are prime between 
themselves and, consequently, according to Theorem 11.5 [11.32] has one solution. 


4 For discrete systems the variable would be “7” 


and not “s”. 
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Before solving the general case, we will deal with a particular case (the one that 
is the most frequently dealt with in other works), which will enable us to show the 
approach used. 


11.3.1 Resolution of a particular case 


We find ourselves here in the case when no specification is made on the setting 
and the interference. Thus, only relations [11.2] should be solved. We know that 
polynomials Am and A, must be stable, but this information is not sufficient for the 
designer and we must know the degrees of these polynomials to write the Sylvester 
system. Can we choose these degrees randomly? The following section makes it 
possible to answer this question. 


11.3.1.1. Conditions on the degrees 


We suppose [DOY 92] that the model of the process is strictly proper, whereas 
the correctors will be supposed simply proper. Consequently: 


(a) d4 > dB 
(b) 2S2>0R [11.33] 
(c) OS >OT 


From relations [11.33(a) and (b)] and [11.2(b)], we obtain: 


OB + OR < JA +0S = 04, +0Am [11.34] 


The uniqueness of the solution will thus be obtained by simply imposing: 
number of equations = number of unknown factors [11.35] 


The unknown factors in [11.2(b)] are the coefficients of the polynomials S and R 
and thus?: 


number of unknown factors = 0S + 0R + 2 [11.36a] 


5 A polynomial of degree n has n + 1 coefficients. 
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The number of equations is the number of rows in the Sylvester system and thus: 


number of equations = 04,, + 0A, +1 [11.36b] 


Considering the uniqueness of the solution and taking into account [11.34], we 
obtain: 


0A, +04, +1=0S +0R +72 [11.37] 

OR = d4-1 [11.38] 
while of course always using [11.34]: 

OS = 0A, +04, — 04 [11.39] 


Until this stage, polynomials A,, and A, do not have any constraint except for 
stability. The conditions for the regulators to be proper will introduce the following 
constraints: 


dS>0R => 04, +04, -04>0A-1 [11.40] 


from where we obtain the first inequality referring to the degrees of Ap and A ;: 


OA, + 0A, 2 204-1 [11.41] 
; T. : 
by using [11.2(a)] and the fact that x is proper, we obtain: 


ƏS2ƏT => dS+0B > OT +0B = dBn +04, [11.42] 


and finally by using [11.39] we obtain a second inequality: 


0A +04, -JA +0B > OB, +04, [11.43] 
— 
as 
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which rearranged gives: 


04m - dB» > 04 -0B [11.44] 


This simply means that the correction can only increase the relative degree. 


11.3.1.2. Standard solution 


We must first of all choose B,,. By using the factorization B = B,B;, where B} 
is stable (we can choose B, = 1 if we want to have a completely free choice of A.) 
and consequently: 


A, = B,A,' and B = ByBm' [11.45] 


In order to minimize the complexity of the elements of the corrector, we usually 
choose a polynomial B,,’ = à the constant @ being chosen in order to ensure a unit 
static gain for the model of reference®. Relations [11.41] and [11.44] thus become 
(OB y,'=0): 


04, — (ƏB, +OBy,') = JA - (ƏB, + dB) 
—— —_— —— 
B,, OB 
=> Am > 04 - dB, [11.46] 
d4,, + 04,'+0B, > 204-1 
=> Am + 04,'> 29A -1- 0B, 


We have seen in [11.30] that polynomial S is thus divisible by B,. Figure 11.2 
gives a graphic representation of the conditions [11.41] and [11.46]. 





6 It is pointed out that this choice is only a necessary condition to the asymptotic follow-up of 
a step function setting (see section 11.2.2). 
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/ 


OA’, 






BA -B — PAm 
Figure 11.2. Choice of the degrees of A, and A,’ 
11.3.1.3. Example 


Let us take an academic example. Let the process be described by the transfer: 


B s+1 
A (52 +0,15+1) 





Therefore, we have the choice to make B; = 1 or s+1 and thus B;, = s + 1 or 1 
respectively. 


B,=stlandB,=1 
The inequalities [11.46] thus lead to the relations: 


dm 23-1=2 
dm +04,'2 2*3-1-1=4 


If we choose the polynomials of minimum degree, we obtain: 


dm = 2 
dA,'= 2 
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We can thus choose’: 


This choice of B,, ensures a unit static gain to the reference model. We can thus 
use the resolution of the Bezout equation: 


AS + BR = Ay, A, 


The resolution of this system of equations in this particular case gives: 


S = (s +1)(s + 7.9) 

R = 22.215? +24.1s+16 
T = B,,A,'= 4(s +2)? 
Bat 


The inequalities [11.46] lead then to the relations: 


An = 3 
OAm +94, '2 2*3-1=5 
here A,'= 4, 


If we choose the polynomials of minimum degree, we have: 


Am =3 
3A '=2 





7 The choice of the roots of these polynomials will be seen later. For the moment the only 
constraint is to choose them stable. 
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We can thus take: 


Am = (s +2) 
4, = (s +2)? 
Bm = 8(s +1) 


By using the same procedure as before we obtain: 


S=s? +7s+7 
R = 205? + 41s +32 
T = Bm'Ao = 8(s + 2)? 


11.3.2. General case 


11.3.2.1. Choice of degrees 


This section is dedicated to the resolution of relations [11.30]. The methodology 
is the same as the one used in the preceding paragraph, but here the conditions of 
being proper do not relate directly to the unknown polynomials. Relations [11.33], 
[11.34] and [11.39] are always valid since we must solve: 





A(Dje B15) + (B1B3 )R = Amn(B140') [11.47] 
— 
S B A 


o 


By taking into account [11.39] and the factorization of S, the degree of S is 
given by: 


OS = 0Am + 0A, — 04 -OD Gy [11.48] 


The uniqueness of the solution will be ensured if the number of equations is 
equal to the number of unknown factors, or: 


0A + 0S +0D, +1 = oS + OR +2 
UY 
number of equations number of unknown factors [11.49] 


=> 0R=0A+0D,, -1 
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To solve [11.30(c)], it is necessary to know the degree of S and thus that of B w» 
which itself is solution of [11.30(d)]. For this equation, there is no constraint on 
being proper and we will set the uniqueness of the solution by retaining that of 
minimum degree in B,,’. The idea is to minimize the complexity of the transfers to 
be done. We consequently obtain: 


OB,,'= 0D; -1 (0D; = 1) [11.50] 


We must now represent the property of the corrector by using [11.48] and 
[11.49]. The inequality 0S > OR gives: 


OAm +04, > 204 +0D% -1 [11.51] 
also, the inequality 0S = OT leads to the condition: 
0A, 2 04+0D,, - OB, -1 [11.52] 
Figure 11.3 represents these inequalities geometrically. 


/°# dA, + OA’, > 23A + Da — 6B1- 1 


OAS À 








D 
BA + ED: — ôB -1 —” OAm 


Figure 11.3. Choice of degrees of Am and A,’ 
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11.3.2.2. Example 


Let us take the following example: 


A=s(s° +0.1s +1) 


B=s+0.5 

D, =s : K 
a => Dy =8 
D, =S 


the inequalities [11.51] and [11.52], by choosing B, = 1, give: 


Am + Am > 2044+ 0Dz, -1=2*3+2-1=7 
Am > 0A+0Dz -1=3+2-1=4 


by using the minimal degrees and by choosing identical dynamics for A,, and A, we 
can take: 


An =(s+1)4 
A, =(s +1) 


We obtain in this particular case: 


R =16.1621s* +19.7042s° +2257 + 10s +2 
S = 5° (s? +6.9s +3.1479) 
T = 4s +145 +1857 +10s +2 


B, =4s° +4541 


11.4. Implementation 


In section 11.1.1 it was mentioned that the structure in Figure 11.1 is formal 
because the represented transfers are not proper. This section makes it possible to 
carry out the control law. 
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11.4.1. First possibility 
Su = Tc — Ry [11.53] 


with physically feasible operators. A possibility [IRV 91] consists of introducing a 
stable auxiliary polynomial F of an equal degree to that of S into relation [11.53], 
which becomes: 


S T R 
~y=—c-—y [11.54] 
F F F 


the corrector is thus carried out as indicated in Figure 11.4. 


Tl aati 
F| NY S5 











Figure 11.4. Realization of the corrector 


This realization is not minimal because it leads to the construction of three 
transfers of OS order. The following section provides a minimal representation of the 
RST regulator [CHE 87]. 


11.4.2. Minimal representation 


If we return to relation [11.53], we can obviously write: 


u=—c-—y [11.55] 


The realization of the first term leads in the majority of cases to the achievement 
of an unstable transfer (S always has a zero root). It is rather necessary to regard the 
corrector as a system having two inputs c() and y(t) and one output u(t) and thus it 
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is enough to write an equation of state verified by this system. The following 
example illustrates the procedure. 


11.4.2.1. Example 


For A =s? +s+1;:B=2;4, = Ay =(9+1)?; D7 = D4 =s, by using the 
previously described procedure, we obtain: 


R=s° +0.55+0.5 
S=5' +35 


T =0.5s° +5 +0.5 


Hence, the control verifies: 
(5? +3s)U = (0.557 + s + 0.5)C — (s? + 0.55 + 0.5)Y 


By leaving on the left the term in U of highest degree and by dividing each 
member by s’, we obtain: 


1 1 
U =0.5C -0.5Y +—{-3U + C -0.5Y + —[0.5C -0.5Y] 
s s 
and thus by supposing that®: 


1 
X =—[0.5C -0.5Y] 
1 s 


1 
X, =—[-3U +C -0.5Y + X1] 
S 


8 Signals x, and x, here do not have any relationship with those in Figure 11.1. 
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in the time field we have: 


x, = 0.5e-0.5y 
X, = —3u+c—-05y+x, 


u=0Sc-y+x, 
finally, by replacing in x, , u by its expression according to c, y and xz: 


xX, = 0.5c-0.5y 
X, =x, — 3x, —-0.5c+25y 


u=0Sc-y+x, 


and in the matrix, we have: 


a à si ep c=lo 1] p=[o5 -1] 


1 -3 —0.5 2.5 


11.4.2.2. Generalization 


If we write’: 


n . 
S = Yas! 
i=0 


n 
R=) ps! with o, =1 and n=max(9S,9R,ƏT) = dS [11.56] 
i=0 


n . 
T = D T;s' 
i=0 





9 The continuous case is used here, but the approach is completely identical to the discrete 
case: it is enough to replace s by z in what follows. 
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this procedure can be generalized. Using [11.52] and [11.56], we obtain: 


n-1 
s"(U-T,C+ Pn) = y's (=0;u + T;c- 0;y) 
i=0 


either by dividing by s”: 


-1 
(U—T,C+0,Y)=S {-0,_ju + TC — Op) + 


S {[(-Op gu + Ty20- Py py) + FS (00u + TC MOY} } 
or by supposing that: 


X = s7! (ogu + Toc — y) 
x2 =s {Cout qe- Py) +X} 
-1 [11.57] 
Xn1=S {COn-24 + Tp-2€ = Pn_2Y) + Xn2} 
Xn = s | {(-On1u + Tn-1C— Pn_1Y) + Xn-1} 


=U+tT,C— Pyy 


It is easy to see that the last equation makes it possible to express u according to 
c, y and x,. That represents of course the output equation and thus: 


[11.58] 


Robust Single-Variable Control through Pole Placement 349 


By replacing u by x, — 7,c + Ony in the expressions of x, X2,....Xn we obtain 
matrices À and B of the equation of state: 








0 0 0 -0 
1 0 0 — 0] 
0 1 * 0 = 
A=|0 0 ` : : 
; z . 0 —On-2 
11.59 
[0 0 . . 0 1 -a,4] Le) 
oT + To = F0Pn — Po 
Oifn + 7 — O1Pn — Pi 
B= : : 
Lon 17n tTn- = On-10n = Pn- | 








11.4.3. Management of saturations 


11.4.3.1. Method 


Many failures that occurred when the so-called “advanced” techniques were 
applied could have been avoided if the implemented regulators had managed the 
inherent saturations of every industrial procedure. The RST control is no exception. 
The previously discussed example can be used to emphasize the problem and its 
solution. We will be dealing with the structure described in Figure 11.5. 

















Figure 11.5. Corrector with saturation 
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The following answers show the difference in operation with and without 
saturation. 








no saturation 
output y(t) is confused with that of reference model B „/Am 











saturation at #0.5 
(a) with saturation, (b) reference model 


Figure 11.6. Comparison of behavior with and without saturation 
(note that the scales in ordinate are not the same!); in both cases F(s) = (s + 1)* 
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The use of the strategy [AST 90, LAR 93] described in the following figure 
makes it possible to considerably decrease the effect of saturation as can be seen on 
the simulation in Figure 11.8. 





Usat 











= Ll kane ins 


enna 





0 123 4 5 6 7 8 8 10 
Time (s) 


Figure 11.8. Behavior in the presence of saturation and with “anti-wind up” with 
the response of the reference model represented by the dotted lines 


This structure can of course be implemented with the help of the realization 
presented in section 11.4.2.2. 
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11.4.3.2. Justification of the method [KAI 80] 


B ; f 
The transfer process P can be described by an equation of state: 


x = Ax + Bu 
pace [11.60] 


where A, B and C are matrices of adequate size and x the state vector. The control 
of this process can thus be done by the technique of pole placement using an 
observer [KWA 72] according to the diagram in Figure 11.9. 


u 





PIW 








Observer 


Figure 11.9. State feedback control with observer 


It is known [KWA 72, LAR 93] that the complete system thus corrected has as 
its poles the eigenvalues of the A-BK matrix (placement of poles) and the poles of 
the observer (principle of separation). If we do not take into account the 
deterministic interferences and the characteristics of the instructions, we have, at 
least, 2n-1 poles to place of which n-1 come from the minimal order observer. We 
thus find the significance of polynomials Am and A, (from where we get the name of 
polynomial of the observer for 4,) 


Let us say that: 


v = Ki [11.61] 


Signals u, y and v are scalar; according to what was previously said, we can 
thus define transfers such as: 





peers [11.62] 
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and thus, based on Figure 11.9, we have: 








u=c-v 
N N BA 
=c- y yl > y= o c [11.63] 
A, À A(A,+N,) + NyB 
A 
u=—y 
B 


By approaching the expression [11.1], we note that the state feedback control 
with observer corresponds to an RST structure where we chose: 


T =A, [11.64] 
polynomials R and S thus being: 


S=A,+N, 


11. 
R=N, [11.65] 


In the presence of saturation, there is the control described in Figure 11.11. 

















Figure 11.10. State return control with observer in the presence of control saturation 
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It is easy to make a connection between this diagram and the one in Figure 11.5. 
The interest in this structure is thus understood since signal v is calculated 
according to the “true” control of the process and its output. 


11.5. Methodology 


We have so far defined the various degrees of freedom of the RST control and 
now, to return to the objectives of section 11.1.1, we still have to define a “good 
transient state” by taking into account the level of noise on the control and the 
robustness aspect with respect to modeling errors. 


11.5.1. Intuitive approach 


Many mishaps can be avoided if we keep in mind the following rule: the more 
important the required performances are, the more disturbed the control will be and 
the more sensitive the corrected system will become to modeling errors. 


The following examples illustrate this “saying” within the context of this 
chapter. The model of the process considered is defined by: 


B 2 


A 5(s7 +0.1s +1) 


the synthesis being done for a step function setting and interference. Taking into 
account what has been done above, polynomials 4, and 4, must be both of a degree 
at least equal to 3. The table below shows the results obtained for various 
polynomials Ám and A, the system being subjected to a step function setting and 
interference at 1 and 15 seconds respectively. 
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Without noise With noise 























0 10 20 30 40 50 7 0 10 20 30 40 50 


Time (s) Time (s) 
Aç=(s+1) An=(s+1)3 























0 10 20 30 40 50 
Time (s) 
A, =(s+2)3 

















Ah 
me sul 








“0 10 20 30 40 50 
Time (s) Time (s) 
A, =(s + 133 Am = (s + 2)3 


Figure 11.11. /nfluence of the measurement noise according 
to the dynamics chosen for À, and Am 
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The column on the left shows the results in the absence of measurement noise. 
We observe that the more the poles of A,, are on the left in plane s, the faster the 
response to the setting is and the better rejected the interference is when the poles of 
A, are on the left in plane s. On the other hand, we note that the increase in 
dynamics by both A,, and A, leads to an amplification of the noise on the control. It 
will never be pointed out enough that the designer should not be satisfied to look at 
the output of the process during the simulation! 


The following example illustrates the finally accepted phenomenon, that the 
increase of the required performances is confronted with robustness with respect to 
modeling errors (the precision-stability dilemma was a beginning). The synthesis is 
always made following the model of the preceding section, the “real” process having 
a time constant of 0.5 s, which is neglected in the model. The results of simulation 
do not require comments. 











Real system 


























0 10 20 30 40 50 ~o 10 20 30 4 50 


Ti : 
ime (s) D An =(8 +2) Time (s) 


Figure 11.12. /nfluence of a modeling error according to the choice of Am 


11.5.2. Reduction of the noise on the control by choice of degrees 


From Figure 11.1, it appears that: 





n [11.66] 


So if we choose the minimum degrees for 4,, and A, according to relations 
[11.49] and [11.51], we observe that the transfer beeen the control and the 
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measurement noise is simply proper. This means that all the high frequencies of n 
will pass in u . Always according to the same relations, all we have to do in order to 
obtain a strictly proper transfer between and n is to choose a polynomial A, of 
degree higher than the minimal value. The following curves properly illustrate this 
phenomenon in the scalar field. 


AR 
Am Ao 
40 








dB 








: Am = (8 + 2)3 


204 


04 























-60 AL LL LL LL -80 T TT T ToT T Toy T Tomy TITI] 
1072 107 100 101 102 103 10-2 10-1 100 101 102 103 
@ (rd/s) @ (rd/s) 
Aj =(s+ 133 Ao = (s+ 1)4 


Figure 11.13. Influence of the choice of the degree of A, on u/n 


11.5.3. Choice of the dynamics of An and A, 


Section 11.5.1 shows the influence of the dynamics of these polynomials on the 
performances of corrected system. Section 11.5.2 provides the starting point of the 
solution by offering an additional choice criterion on their degree. We will discuss 
here the choice of their roots. This section is based on [LAR 99] where the author 
develops a methodology based on the LQG-LTR control. 


In all that follows, we will suppose that the setting and the interference are step 
functions and that polynomial T is chosen equal to A, i.e.: 


Dz = Dj = Dx =s [11.67] 
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in this case, conditions [11.30] on the degrees become: 


OA, = 0A 
0A, = 0A+1 [11.68] 
OR =0A 
OS =0A+1 with S=sS' 
S’ being the solution of: 
AsS'+BR = Ay Ao [11.69] 


11.5.3.1. Optimal choice of A,, 


In the absence of interference, according to [11.23], the tracking error is given 
by: 





BT B 
E=C-Y=C- C=Cl1- [11.70] 
Am 4o A 


m m 


A possible synthesis strategy can be the choice of A, in order to minimize the H, 
standard of this transfer. A direct application of the variational calculation leads to 
the following stable optimal solution: 


AP = B*(s)B (=s) [11.71] 


This means that the roots of Avr are the stable roots of B and the unstable roots 
of B made symmetrical with respect to the imaginary axis. Of course this solution 
does not satisfy the constraint on the degrees. For that we will have to add a certain 
number of roots, such as: 


AP = B*(s)B_(-s)(1+ T,s)94798 [11.72] 
If the model has minimum phase difference (B = 1), we can show that: 


lim S'= B [11.73] 
T.—0 
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indeed, S’ is the solution of: 
AsS'+BR = B(L+T,s)4-® 4, [11.74] 
and thus S’ is divisible by B. After reduction by B, we obtain: 


R = (1+ Tys)4-* A, — Ass" 
S'= BS" 


[11.75] 


when T, tends toward zero, we see that S’’ must tend toward a constant so that 
OR remains equal to OR . Thus, by considering the highest degree terms of A and A, 
as equal, $” will tend toward 1 when T, tends toward zero. 


11.5.3.2. Optimal choice of A, 


We have seen above that the transfer connecting the measurement noise to the 
control is given by: 





U AR 
= [11.76] 
N Ams 
for instance, in the limit case mentioned previously: 
U AR A(A, — SA 
= Po) [11.77] 





N Amâ BAT 4 


In order to reduce the influence of the noise on the control! we can choose A, in 


ee sA A : 
order to minimize the H, standard of 1 — ——. The use of the variational calculation 
(0 
leads as before to a polynomial A, whose roots are the stable roots of s4 and also 
symmetric with respect to the imaginary axis of the unstable roots. However, here 
there is the difficulty of the root at the origin. So we define the under optimal 
solution: 


10 This procedure supposes seeking an observer which could minimize the restoration noise 
of Kx (see section 11.4.3.2). 
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A, = Porc + > [11.78] 


o 


this solution tending towards the optimal solution for T, tending towards infinity! 1. 


11.5.3.3. Study of the sensitivity function 
Let L(s) be the transfer of the loop; we can define the sensitivity function È by: 


1 AS AS 


>= = —q— = 
1+L  AS+BR A,4, 





[11.79] 


in the case of a system without unstable zero, the choice of an optimal polynomial 
An leads to the sensitivity function: 


À ' 
ye 4S [11.80] 


A BETA 


and thus for the choice of A, seen above and when T, tends toward zero we have: 


SA 
À 


ne 
= A OA Je [11.81] 


1 
| + aa (s)A-(-s)| |S * T, 





lim [2] = 
T,>0 ő 








o 


this for a process without integration ( 4(0) # 0). 


1 
Thus, everything occurs as if the loop transfer was —— and we obtain a phase 
sTo 


1 PERRE dir | 
margin of ay, to the angular frequency ©, = Ge , which gives a limit delay margin 
o 


o: 


1 
of M, =—T, 
2 


11 The presence of integrators in the process will be treated in a similar way. 
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In the case of a model with one integration, the limit sensitivity function will be: 


lim [2] = <1 [11.82] 
T.—0 1 2 
s +— 
To 


the equivalent loop transfer thus becomes: 








L= 1+27,5 
T 





[11.83] 


and the lag margin being M, = 0.647 T, . 


It should be noted that the sensitivity function is always of a module less than 1 
(except for the infinite frequency). 


11.5.3.4. Some improvements 


If the poles of A (or their symmetric) are slightly damped or too slow, the choice 
of A, suggested above may lead to bad dynamics of regulation. This is why it is 
advisable to project these undesirable poles on the abscise vertical —1/ To. 


Pt Pi 


Transformed Initial 
pole pole 


10 





Figure 11.14. Choice of the roots of A, 
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Thus, the contribution of this pair to the sensitivity function is: 


(s — pi) 


<1 [11.84] 
(s- Pt) 


s=i@ 








and therefore this transformation does not affect the main result of these choices: the 
limit sensitivity function is always of a module less than 1. 


Similarly, the presence of strong dynamic zeros leads, through the procedure of 
optimal choice of Am, to imposing fast methods to the corrected process. Several 
methods are proposed in order to avoid this obstacle [LAR 99]. We will suggest here 
the procedure illustrated in Figure 11.15. 
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Figure 11.15. Choice of the roots of Am 


This procedure is of course to be prohibited if the transformed pole proves to be 
not sufficiently damped. 


NOTE 11.2— we note that if we choose to make A, tend toward sA (always under 
the hypothesis step function setting and interference), the R solution of the Bezout 
equation will tend toward A, and the solution S’ toward Am. Consequently, the 
AS SA s 

A 


sensitivity function will still have the same limit 





1 
o gp 
T, 


o*m 


We can thus reach the same limit sensitivity function either by making Am. tend 
toward B (T:—0) or by making A, tend toward sA (T—ce). 
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11.5.3.5. Method of adjustment 


Based on the particular choices for Am and 4,;: 


1 
Ay = AS + — 
To [11.85] 


Am SRST) 


the optimal solution corresponds to T, — œ and T, — 0, or in practice when: 


T, 
EC [11.86] 


Practically, we obtain a sensitivity close to the limit sensitivity for a 5 order 
ratio. 


Increasing 7, increases the lag margin at the expense of the dynamics of 
regulation. 


Decreasing T, makes the corrected system more sensitive to measurement noises 
(see the example in section 11.5.2). 


The initial value of T, can be either: 
— the desired lag margin; 
— the dominant time constant in desired regulation; 


— a value close to the dominant time constant in open loop (for a stable process 
of course). 


11.5.4. Examples 


In section 11.5.1 we presented an example in order to highlight the influence of 
the dynamics of polynomials 4,, and A, on the correction. We will reiterate this 
example by applying the methodology developed above. As the model of this 
example presents a slightly damped oscillating dynamics, we will be able to clarify 
the “improvements” described in section 11.5.3.3. The model is thus described by 
the transfer: 
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CEE [11.87] 
A s(s° +0.1s +1) 


the setting and the interference being always steps functions. In accordance with the 
suggested methodology, we will have: 


Am = 2(T.s +1) [11.88] 


and: 


2 
A, =(s" ots+ifs+t] [11.89] 


o 


if we “hide” the oscillating poles, or: 


1 4 
= £ = [11.90] 


in the opposite case. The following figures show the results obtained in the scalar 
field. Figure 11.16 corresponds to choice [11.89], whereas Figure 11.17 corresponds 


: : : ne . SA 
to choice [11.90]. We notice that in each case, the target sensitivity function — 
A 


o 
' 


and the real sensitivity function 





have their modules close, thus justifying the 
om 

approximation carried out in the design. In addition, in choice [11.89], we notice 

that the target sensitivity function is of a module less than 1. 
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The following figures give an outline of the time performances obtained in each 
one of these cases for various choices of T,. Each figure represents the behavior of 
the corrected process following a rule: 


— without measurement noise and model identical to the process; 
— with measurement noise and model identical to the process; 


— without measurement noise and process with a time constant of 0.5 s neglected 
in the model. 
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T.=04;T,=2 
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From these answers we notice that the lower T., the faster the feedback control 
response, but on the other side the control is subject to more noise. On the other 
hand, the higher 7,, the more robust the system will be to modeling errors, with a 
rejection dynamics of less competitive interference (we will have noticed in the last 
simulation the appearance of instability). Finally, in [11.89], the fact of hiding the 
oscillating poles of the process leads to a rejection of oscillating interference. 
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Figure 11.18. Time responses for T, = 5 seconds 
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Due to the degree of robustness when facing modeling errors, of the level of 
measurement noise, the designer can “easily” adjust the only two adjustment 
parameters Tand T.. 


11.6. Conclusion 


Many problems of automatic control can be reduced to single-variable 
corrections. In this context, the placement of poles performed by the RST structure, 
i.e. with a corrector with two degrees of freedom, has been developed these last few 
years. This chapter aimed at providing the designer with the maximum of degrees of 
freedom to carry out this synthesis. 


Initially, the general solution of the RST control was given. In particular, the 
choice of synthesis polynomials degrees was discussed according to the classes of 
settings and interferences. Then the aspects of implementation were developed and, 
in particular, the problem of saturation of the actuator was largely dealt with. 
Finally, a methodology of choice of feedback control and regulation dynamics 
(choice of polynomials Am and 4o) was provided. This approach makes it possible to 
set performances compatible with a level of robustness chosen in advance. This 
simple approach uses two adjustment parameters (two time constants) which makes 
it accessible to non-specialists. 
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Chapter 12 


Predictive Control 


The developments presented in this chapter aim to cover the main ideas of 
predictive control and then to indicate the details of the analytical minimization of 
the criterion for two individual structures enabling the elaboration of the equivalent 
polynomial regulator. The choice of adjustment parameters will also be analyzed, 
providing some simple rules that guarantee the corrected system good stability and 
robustness. 


12.1. General principles of predictive control 


Predictive control is based on some relatively old and intuitive ideas [RIC 78], 
but it has been developed as an advanced control technique mainly since the 1980s. 
This development was done mainly according to two privileged main lines: 


— generalized predictive control (GPC) by Clarke (1985); 
— functional predictive control (FPC) by Richalet (1987). 


The philosophy of predictive control lies on the definition of five great ideas, 
common to all the methods. 
12.1.1. Anticipative aspect 


This anticipative effect is obtained by using explicit knowledge on the evolution 
of the trajectory to be followed in the future (necessary knowledge required at least 
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on the horizon of some points beyond the present moment). This constraint which 
makes it possible to make good use of all the resources of the method, necessarily 
restricts the application field to the control of the systems for which the trajectory to 
follow is perfectly known and stored pixel by pixel in the computer. It is the case of 
the numerical control of machine-tools (cutting the pieces), of the control of robots 
arms, of monitoring the temperature profile of the applications in home automation, 
etc. 


12.1.2. Explicit prediction of future behavior 


The method requires the definition of a numerical model of the system, which 
makes it possible to predict the future behavior of the system. This discrete model 
results mainly from a preliminary offline identification. This feature makes it 
possible to classify predictive control in the big family of Model Based Control 
(MBC). 


12.1.3. Optimization by minimization of a quadratic criterion 


The optimization which makes it possible to obtain the control law is done by 
minimizing a quadratic criterion with finite horizon referring to the errors of future 
prediction, the variance between the predicted output of the system and the future 
setting or the reference trajectory inferred from this setting. 


12.1.4. Principle of the sliding horizon 
The elaboration of a sequence of future controls results from the preceding 
minimization, which is optimal in what the quadratic criterion is concerned, out of 


which only the first value is applied to the system and the model. 


The preceding steps are then repeated during the following sampling period 
according to the principle of sliding horizon, as seen in Figure 12.1. 
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Future controls 





Future 


Figure 12.1. Principle of sliding horizon 


Predicted|output y 


The objective of the polynomial predictive regulator obtained by minimizing the 
criterion is that the predicted output joins the setting or the reference trajectory on a 
given prediction horizon. The principles that we have just mentioned make it 
possible to establish the operation diagram in Figure 12.2. 
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Figure 12.2. Operation principle of a predictive algorithm 


Hence, the principle of the sliding horizon means that only the control at the 
present moment u(f) is applied on the system. Therefore, it is possible to limit the 
number of estimated values of the sequence. 
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12.2. Generalized predictive control (GPC) 
12.2.1. Formulation of the control law 


The objective of this section is to indicate the fundamental points of the 
predictive structure considered [CLA 87a, CLA 87b], in the monovariable case, 
from the mathematical translation of the preceding general concepts up to the 
obtaining the equivalent polynomial regulator. 


12.2.1.1. Definition of the numerical model 


All the forms are allowable for the model but the input/output polynomial 
approach by transfer functions is preferred. 


Traditionally the model is represented as CARIMA (Controlled AutoRegressive 
Integrated Moving Average): 
= t 
AGO = BG uD + [12.1] 
Alq 


where A(g"!) =1- q! , u(t) and y(^ are the input and output of the model, é(t) 
is a centered white noise, g is the delay operator and Aq’) and B(q |) are 
polynomials defined by: 


-1 -1 
Alq )=l+a;g +: Hd, q a Do 
P l [12.2 
Bq )=botbiqg toce +b», q 


This model, which is also called incremental model, introduces an integral action 
and makes it possible to undo all the static errors with respect to the input or step 
function interference. 


12.2.1.2. Optimal predictor 


The predicted output y(t+ j/t) is traditionally decomposed into a free and 
forced response [FAV 88], including a polynomial form meant to properly conclude 
the final polynomial synthesis: 


V+ JID =F T )yO+ Hj (q ')au(t-)+G,(q )Aue+ j-D+I DEC) [12.3] 


free response forced response 
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The unknown polynomials F;, Gj, H;, J; are single solutions of Diophantus 
equations, which are obtained by equality of the inputs and output of transfer 
functions of equations [12.1] and [12.3] and they are solved recursively: 





Aq Aq )J(q')+q FE 





GD +q H= BADIA ei 
ne sees ie ')|= j-1 tele (a = degree Lag” 1] 
wit keele k = j-1 degree [H ie = degree [el oey 


The set of calculations may be done in real-time off loop. The optimal predictor 
is finally defined by considering that the best noise prediction in the future is its 
mean (here supposed as zero), let us suppose that: 


Het JID =F (q y+ Hq ')Aut-1) +G; (q Aut+j-) [12.5] 
12.2.1.3. Definition and minimization of the quadratic criterion 


The control law is obtained by minimizing a quadratic criterion pertaining to 
future errors with a weighting term on the control: 


N> N, 
J= $ DE+)-wE+ j? +49 A + j- [12.6] 
j=N; jel 


with: Au(t+ j)=0 for j2 N,- 


The criterion requires the definition of four adjustment parameters: 
— N; : minimal prediction horizon; 

—N>: maximal prediction horizon; 

— N, : prediction horizon on the control; 


— À : weighting coefficient on the control. 
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12.2.1.4. Synthesis of the equivalent polynomial RST regulator 


The minimization of the criterion is based on writing the prediction equation 
[12.5] and the cost function [12.6] in a matrix form, such as: 


$ = Ga tif y(r)+ih Au(t—1) 


J =(Gi+ if y) + ih Aue -1)- w]! 


[12.7] 
[Gü+if y) +ih Aut-D-wl+Aù7 ü 


it=[Fy gt) Fy, bl 


with: ih = [Hy a» ve eae) 


a=[Au(t) + Au(t+N, -D 
Sts n - 
EN, ÊN- 
Ni+1  Ni#l 
G=|ÊM+ ÊN, 











Ny N, eae No 
| EN, ÊN,- EN, -N, +1 | 


The analytical minimization of the criterion leads to an optimal sequence of 
future controls: 


dope = N[w-if y()—ih Aut -1)] [12.8] 
with: N=[G™G+Aly,]1G™=[nf + nh T 

o T 

Boy = AUDop = Au(t+ Ny = Dope | 


w=[|w@+N,) = w(t+ Nə)" 
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Traditionally, in a predictive control, only the first value of the sequence, 
equation [12.8] is applied to the system, according to the principle of the sliding 
horizon: 


U opt (Ð) = Yop: (nF lif yO) +ih Aug, (¢-D—w] [12.9] 


Based on the above relation, it is finally possible to obtain the polynomial 
representation of the equivalent regulator as indicated in Figure 12.3. This traditional 
RST structure enables the implementation of the control law by a simple 
difference equation: 


S(T DAUA) = -RDV + TWA) [12.10] 


The three polynomials have the following form: 


Sq’) =(1 +n} ih q7’) degree [sa] = degree ER] 
R(q |) = ni if degree [Ra] = degree [aa] 
Tq) =n] |! e q™ | T degree [T(q)] = N3 





"TENA CAN | 2 











Equivalent polynominal regulator 


Figure 12.3. Structure of the equivalent polynomial regulator 


We observe that polynomial T(q) encloses the non-causal structure (positive 
power of q ) inherent to the predictive control. 


The interest resulted from the RST representation (actually very general because 
any numerical control law can be modeled this way [LAN 88]) is that, finally, the 
real-time loop proves to take little calculation time as the control applied to the 
system is calculated through a simple difference equation [12.10]. The three 
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polynomials R, S, T are actually elaborated offline and uniquely defined as soon as 
the four adjustment parameters are chosen. 


Consequently, this type of control favors the selection of short sampling periods 
and it proves to be well-adapted to the control of fast electro-mechanical systems 
(machine-tool, high-speed machining, etc.). 


Another major interest in the RST structure pertains to the study of stability of 
the corrected loop and thus the characterization of stability of the elaborated 
predictive control, which is from that moment on possible for a set of parameters of 
the fixed criterion. This study is examined in the following section. 


12.2.2. Automatic synthesis of adjustment parameters 


The definition of the quadratic criterion [12.6] showed that the user must set four 
adjustment parameters. However, this choice of parameters proves to be difficult for 
a person who is not a specialist because there are no empirical relations which make 
it possible to relate these parameters to traditional “indicators” in control such as 
stability margins or a bandwidth. 


Based on the study of a great number of single-variable systems, it is however 
possible to issue some “rules” based on the traditional criteria of stability and 
robustness [BOU 92] that we summarize. 


12.2.2.1. Criterion of stability and robustness 


First of all, the objectives of stability are related to the study in Bode, Black or 
Nyquist planes of the transfer function of the open loop corrected by the predictive 
regulator: 


ga BITOR) 
AHSA 





Hp (q "= [12.11] 


It is generally agreed that a “good” adjustment is characterized by: 
— a phase margin Ag higher to 45°; 


— a minimal gain margin AG from 6 to 8 dB (decibels). 
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The objectives of robustness are linked to the calculation of the delay margin 
AT = Ag/o, (Ag in rad, œ, gap angular frequency at 0 dB) [12.12] 


to the study, in the scalar plane, of the direct sensitivity functions øg and 
complementary sensitivity functions ©. : 


-1 -1 -1 
ee Alq )S(q )A(G ) [12.13] 


Aq” ')S(q" Aq!) +q” BAHRA”) 








q | B(q')R(q"') 


OO ee II i ee [12.14] 
Aq Sla )A(g )+q Bq )R(q ) 





It is generally agreed that a “good” adjustment is characterized by: 
—a delay margin higher than a sampling period; 
—a direct sensitivity function of a module lower than 6 dB; 


— a complementary sensitivity function of a module lower than 3 dB. 


12.2.2.2. Selection procedure of the criterion parameters 


From the criteria formulated above with the help of the traditional tools of scalar 
Automation, it is possible to choose the sets of satisfactory adjustment parameters: 


— N,: prediction horizon lower on the output. The product N T, (T, sampling 
period ) is chosen as equal to the pure delay of the system; 


— N,: prediction horizon higher on the output. The product N, T, is limited by 
the value of the response time. The bigger N, is, the more stable and slower the 
corrected system becomes; 


— N, : prediction horizon on the control. Choosing N,, equal to 1 simplifies the 
calculation and does not penalize the stability margins (on the contrary, a higher 
value tends to decompose the phase margin); 


— A: weighting coefficient on the control. This parameter is related to the gain of 
the system, through the empirical relation: 


A 


opt =tr(G'G) (G matrix described in 12.2.1) [12.15] 
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The choice of parameters is frequently limited to a bi-dimensional search ( N3 
and À ) ending with the selection of a “good” adjustment. 


12.2.3. Extension of the basic version 

Based on the preceding easy version, several derived strategies were developed, 
which made it possible to recognize: 

— closed loop pre-specified dynamics (structure of multiple reference models); 

— several variables to control (cascade structure); 


— constraints imposed on the input and output signals. 


12.2.3.1. Structure of multiple reference models 


The aim of this predictive structure of multiple reference models is double. 
Firstly, it makes it possible to impose a reference trajectory through a stable pursuit 
of a model determined by the user who tones down the conformity with the setting. 
This pursuit model imposes the dynamics of the looped system (input/output 
behavior) and it may be considered as a pole placement. 


It is also a matter of weakening the quick control variations that we can 
sometimes recognize through the preceding algorithm, by trying to recreate the 
reasonable reference control that must be applied to the system in order to obtain, at 
the output, the reference trajectory and by creating in the criterion a minimization on 
the control error and not only on the control. 


The digital model of prediction is defined here again as CARIMA: 


Ag) y(0)= Bg u(t 1) + SO [12.16] 


The pursuit model chosen by the user makes it possible to specify the reference 
trajectory y,(f) that the output of the system will have to follow: 


4T Dy O = qB (TWO [12.17] 


where: B,(q7™') = B(q”')P(q"'). 
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Pq’) is conceived in such a way as to insure the asymptotical behavior: 
y, (ce) = w(æ) Thus, for a step function setting, we can choose: 


ZUE Cte 


Paq’ = T 


A(q') is generally a second degree polynomial making it possible to impose a 
desired response time as well as an adapted damping coefficient. 


Coupled to the reference trajectory y,(f), a reference control u,(¢), which is 
allowed by the system, is equally defined, the two trajectories being related by the 
relation: 


ATY, O =q BT u, (t) [12.18] 


In order to avoid the reverse of the model and the stability problems related to 
polynomial B(q |) that may result, equation [12.18] can be formulated again based 
on relation [12.17] by: 


ATu, © = AG PQ wo) [12.19] 


Figure 12.4 sums up the principle of this structure with reference models [IRV 
86]. 
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Figure 12.4. Principle of GPC/MRM algorithm 
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The cost function is henceforth a weighted sum affecting the squares of the 
output predicted errors and the squares of the future control error increments: 


N. N, 
J= Yeres +a deze+j-v [12.20] 
J=N, j=l 


t+j)= D} +j)-y,(C+)j 
with: ¢,(¢+ j)=0forj > N, and: Eyl 2 À D yr I) = 
Eu (t+ j) = Au(t+ j)— Au, (t+ j) 


Based on relations [12.16], [12.17] and [12.18], we notice that, on the one hand, 
the increments of control errors and, on the other hand, the output errors are linked 
by the relation: 


A(q7' E (0 =B )e, ¢-D +E) [12.21] 


which corresponds exactly to the CARIMA structure [12.16], parameterized again in 
terms of signals pertaining to input/output errors. The entire theory previously 
developed in the case of the GPC “traditional” algorithm can be preserved by 
replacing in the minimization process y by €,, Au by €, and w by 0 (the system 
must indeed follow a zero error setting). From this moment on, the minimization of 
the quadratic criterion [12.20] reaches the optimal sequence: 


Eu opt =—Nif ey) +ib E, oy (t-D] [12.22] 


with: Eu opt = [ Eu opt “+ E+N, Top | 


Here again, only the first value of the sequence, equation [12.22], is applied to 
the system, according to the principle of sliding horizon: 


En opt (Ò = -nt lit E, (E) +ih €, oy (t -1)] [12.23] 


We infer from it the equivalent polynomial regulator of this restated problem in 
terms of error signals: 


STD En op (0 = -RD EA) [12.24] 
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S(q') = (l+n} ih q’) degree [sa] = degree [Ba] 
with: 
R(q |) = ni if degree EUR] = degree [aa] 

The control applied to the system is inferred from the difference equation: 
SG DAUA) = RQ O+T,. (QG w(t) [12.25] 


With, based on relations [12.17] and [12.19]: 


Pq) 
—] 
rq ) 





Tq) = Aq Aq) Sq7')+q BAHRAT) [12.26] 


This control law is based again on an RST structure, with the same polynomials 
R(q |) and S(q |) as those obtained through the traditional algorithm; only 
polynomial T(q) is modified, becoming a causal rational fraction and explicitly 
considering the pursuit model chosen by the user. Furthermore, the calculation of the 
input/output closed loop makes it possible to verify that the resulting dynamics is 
defined by the pursuit model, which is not at all the case of the transfer function 
between the output and the interference. 


12.2.3.2. Cascade structure 


The cascade structure suggested makes it possible, in the case of a two-loop 
version, to simultaneously control two variables (for instance speed and position, for 
the regulation of the electro-mechanical systems). In the internal loop it includes a 
predictive structure with multiple reference models developed above, paired to a 
GPC traditional algorithm for the external loop, as indicated in Figure 12.5 


The synthesis of the regulator of the internal loop is considered according to the 
GPC/MRM strategy of the previous section, in such a way that the internal regulator 
Ry, S and 7, is finally implemented by the following difference equation: 


Sy(q7' )Au(t) = -RTA + PQ | wa (0) [12.27] 


The predictive model used for the synthesis of the external regulator consists of 
two terms: on the one hand the model corresponding to the asymptotical behavior of 
the closed internal loop and on the other hand the model issued from the external 
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system (defined by the polynomials 4 (q7!) and B,(q |) ), according to the 
relation: 








MORR Birla ') 4 Bola ) Big ) [12.28] 
wO ALT) 41) Al) 
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Figure 12.5. Predictive cascade structure 


In order to obtain this cascade structure [BOU 91], a GPC “traditional” algorithm 
is perfected, obtaining the external regulator as an equivalent RST polynomial form, 
this regulator is also implemented with the help of a second difference equation: 


STHA mA = -R TAT aw CA [12.29] 


The global implementation of this two-loop structure thus requires the 
programming of two difference equations, with weaker degree polynomials and 
therefore a short calculation time for the algorithm. The fact that this structure can 
be generalized to any number of loops proves that it is really adapted to the fast real- 
time loops, like the single-loop structures considered previously. It is important to 
indicate that only a “single rhythm” cascade has been presented so far (i.e. with one 
and the same sampling period for the two loops). Other cascade predictive structures 
were developed, either “multi-rhythm” (different sampling periods between loops), 
or by using operator 5 when over-sampling problems appear. The reader can refer to 
[BOU 95] and [BOU 96] for more details. 
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12.2.3.3. Recognition of equality type terminal constraints (CRHPC) 


A number of various physical problems require the recognition of constraints on 
the output signal as well as on the control. These constraints may be of “equality” 
type, when we try to impose precise values to the signals considered, or of 
“inequality” type, making it possible to define saturations or ranges of specific 
variations for a signal. The aim of this section is to examine the recognition of 
“equality” type constraints within the GPC algorithm. 


Imposing “equality” type constraints requires the minimization of the cost 
criterion GPC subjected to a set of m equality constraints in the future, called 
terminal constraints, defined by: 


t+ Na +j)= wt+No) for j=1,..., m [12.30] 


where m is the number of points for which the predicted output must coincide to 
the setting w after the higher prediction horizon N3 . This strategy was developed in 
a version called CRHPC (Constrained Receding Horizon Predictive Control) [CLA 
91]. 


Equation [12.30] may be transposed in the following matrix form: 








Gu=w-f [12.31] 
EN,+1 EN, 7 &EN,-N,+2 
; EN, + £ 7 EN,-N,+ 
with: G.= N 2 N, +1 N, Ny 3 
EN,+m &EN,+m-1 `“ EN,-N,+m+l 





We =[w(t + Np) ye w(t + N3) 


f, =[f-(+ No +1),..., falt + No +m) 


f. represents the free response of the system under constraint, defined similarly 
for the free response of the non-constraint system (equation [12.6]), by: 


fe =if, y(O+ihe Au(t-1) [12.32] 


388 Analysis and Control of Linear Systems 


with: 


E at 
if, = BAC a Ps (G J 


f -1 AT 
ihe =| Hy, Joss HN, +m(q )] 


The algorithm CRHPC consists of a GPC traditional algorithm related to the 
concept of terminal constraints. Based on the numerical model of the system 
(equation [12.1]), of the optimal predictor (equation [12.5]), of the quadratic 
criterion (equation [12.6]) and of the terminal constraints (equation [12.30]) and 
with the help of Lagrange multiplier factors, the optimal solution of the problem 
(equation [12.6]) under the constraints (equation [12.30]) is obtained in matrix form: 





ü on =2 [HG] (GAGI IGA -H'] G" e+ [12.33] 
+H Ge (G-H7 Ge) b 


with: H=2G'G+Aly ) 


c =(w-if y(t)-ih Au(t—1)) 
b =(if, y()+ih, Au(t—1)) 


All the vectors and matrices defined above are made up of coefficients 
intervening in the j- interval predictors (equation [12.5]) for M Sj <No +m. 


With an approach similar to the one adopted for the simple predictive structure, 


only the first value of the previous sequence is applied to the system, according to 
the principle of the sliding horizon: 


Aulop:(t)=my b+nj € [12.34] 


with: m} firstrowof (HIGI (G.H Gt) 
ny. first row of [2 (H! -H 'Gl(G,.H 'G2)'G.H') GT) 
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Equation [12.34] corresponds to a linear corrector that can be written in an RST 
form, totally similar to the one obtained without constraint: 


ST DAUE) =-RQ) (0) + TQ [12.35] 
The three polynomials R, S, T take the following form: 


Sq!) =1+q!(nTih +mTih,) 
Rq) =n] if +m} if, [12.36] 


N N: Tr N: N» -T 
TA =n lq + PF + 9%] 


We can find in this structure a basic part resulting from the algorithm without 
traditional constraint, with elements corresponding to the recognition of constraints. 


The fundamental advantage of this version under CRHPC constraints is to ensure 
the stability of the looped system for particular choices of adjustment parameters 
[LEV 93, NIC 93]: 


N, = pure delay of the system 
N, 2 degree | Ag) ]+2 

N, = N,+1 

m = degree [aa |+ 

À > 0 


The recognition of terminal constraints was developed here only in the context of 
a “traditional” structure of the GPC algorithm. This formalism can also be 
introduced in the single- or multi-rhythm cascade structures with operator 6 .. A 
unified version including the equality and inequality constraints was perfected, 
leading to the elaboration of GPC regulators by non-linear quadratic optimization. 
The reader can refer to [DUM 98]. 


12.3. Functional predictive control (FPC) 
This second structure of predictive control is introduced here by indicating the 


big ideas of the method, starting with the form of the model, the quadratic criterion 
and up to the examination of adjustment parameters. The formalism and the 
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calculation necessary to the analytical minimization of the criterion will not be dealt 
with so that the presentation does not become too difficult. The reader may refer to 
[COM 94, RIC 87] for minimization details of a simple structure and [RIC 93] in the 
case of a cascade structure. 


12.3.1. Definition of numerical model 


As in the case of GPC, there is no restriction for the form of the model, but the 
approach by state variable representation is preferred. The first versions of FPC used 
even a representation by convolution with the help of the coefficients of the discrete 
impulse response. 


If we note by u(t) and s(t) respectively the input and output of the model, this 
model is traditionally represented by the system of equations: 


ee [12.37] 


Sm(t) =H Xm(t) 


12.3.2. Choice of a reference trajectory 


The reference trajectory, initialized on the output of the process at instant f, 
specifies the way in which we want the process to relate to the setting on a given 
prediction horizon. All choices are possible but the easiest is to consider a first order 
dynamics for the variance between the setting and the reference trajectory. If we 
note by s,(¢) the output of the system and w(t) the setting to be followed, this 
reference trajectory s p(t) is then defined by the relation: 


[wit -sre liw- so] [12.38] 
where 0 <œ <1 is a parameter that conditions the speed of the conformity desired: 
3T 


e 
T, 


r 





a = exp(-—) [12.39] 


with T, the sampling period and T, the response time in closed loop. 
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12.3.3. Object-model difference 
FPC makes it possible to adjust the prediction of the process output obtained 
through the model, by taking into account the modeling and identification errors as 


well as possible interferences. Hence, we introduce a signal measuring the variance 
between the system and the model [SAN 94], called omd (object/model difference): 


omd(t) = s(t) — S,,(t) [12.40] 


Hence the aim is to provide a future prediction of this variance omd (t+ j), so 
that we have: 


S (t+ j) =8,,(t+ j) + omd(t + j) [12.41] 
The “level” prediction that consists of considering: 
omd(t + j) = omd(t) = Sy (t)-s,,(t) [12.42] 


corresponds to a case of a 0 degree self-compensator capable of blocking a static 
variance. In general, the self-compensator is written: 


d, 
omd(t+ j)=5,()-5, 0+ J O [12.43] 
i=l 


where d, is the degree of the extrapolator. 


12.3.4. Structure of the future control 


In an original way, compared to other predictive techniques, the future control is 
structured here in the form of a linear combination of preliminarily chosen functions, 
called “basic functions” and marked {uj £},k=1---n, : 


u(t+ j)= Y urup Cj) [12.44] 
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Hence, the calculation of the future control sequence requires the determination, 
at every instant ¢, of the unknown coefficients {44},k =1---n, . Furthermore, based 
on the strategy of sliding horizon, only the first value of the sequence is applied, 
which requires that the choice of basic functions must be done in such a way that at 
least one function verifies u,,(0) #0. 


Traditionally, the basic functions are canonical functions (step function, ramp, 
parabola), which are chosen according to the type of the setting and the integrator 
character of the process. Table 12.1 provides the value of error w—s,, in the case of 
a non-integrator system. 











Functions Step function Step function + Step function + 
Input ramp ramp + parabola 
Step function 0 0 0 
Ramp Cte 4 0 0 0 
Parabola co Cte # 0 0 

















Table 12.1. Error w— Sp for a non-integrator system 


12.3.5. Structure of the optimal predictor 


The predicted output s,,(¢+j/t) is traditionally decomposed into a loose 
response and forced response: 


Sm(tt+j/D=Sy t+ )t+smp (tts) [12.45] 
with, taking into account the basic functions and the state model: 
My 
t+j)= t j 
Smr (+) 2H Smok CJ) [12.46] 


SmL(t+j)=HF/x, (0) 


Smbk representing the forced response of the model at input wp; . 
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If an input/output polynomial structure is chosen, the optimal predictor will have 
a similar form to the one developed in the case of GPC equation [12.5] [DUM 92]. 


12.3.6. Definition of quadratic criterion, concept of match points 


The FPC control law is obtained by minimization of a quadratic criterion 
pertaining to the future errors with a weighting term on the control: 


Ny 
D=>°15, (0+ hj)-sp@+hj)] 24A PA [12.47] 
j=l 


Based on equations [12.41], [12.43] and [12.45], the criterion thus chosen 
minimizes the variance between the output of the predicted process and the 
reference trajectory in a certain number of points called match points: let h; be 
these points and n, their number. The approach followed during the minimization 
of the criterion, before reaching the future controls structured by relation [12.44], is 
summed up in Figure 12.6. 


As for the generalized predictive control, the minimization (not detailed here) of 
the preceding criterion leads to an equivalent polynomial regulator in RST form 
represented in Figure 12.3. 







Reference Setting w 
trajectory | 
Match points — 
Prédicted output Sp 
Output Sp 


Future|control 


IL, | 


Past Future 
t t+h 











Figure 12.6. Reference trajectory and match points 
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12.3.7. Adjustment parameters 


Based on the preceding theoretical developments, it appears that the 
implementation of a functional predictive control law involves the choice of the 
following parameters: 


— T,.: desired response time. This parameter is used to indicate the sampling 
time if necessary. It also intervenes in the definition of coefficient @ of the reference 
trajectory; 


— n, andu; + : the number of basic functions and their nature. These parameters 
are set as soon as the nature of the setting signal and the integrator type of the 
process are known; 


— n andh;: the number of match points and their place. The mathematical 
resolution requires a number of equations more than or equal to the number of 
unknown factors; traditionally, we choose ny =n, and the match points are placed 
within the prediction horizon, which is limited by the response time desired. The 
more a point will be placed at the beginning of the horizon, the faster the system will 
most probably be; 


— A: weighting coefficient on the control. This parameter is related to the gain of 
the system have through a relation similar to the one defined in GPC. 


The choice of parameters proves to have, in the single-variable case, a 
complexity equal to that observed in the case of a generalized predictive control 
structure, as only the match points (often restricted to one) and the weighting 
coefficient on the control are to be fixed, equivalent to horizon N, and to 
coefficient À of GPC. 


12.4. Conclusion 


Predictive methods considered in the preceding sections showed the simplicity 
of their design and implementation because they are always translated, irrespective 
of the versions considered (simple or cascade), outside inequality constraints, by the 
real-time programming of several difference equations, which are generated from 
the RST polynomial structure of the equivalent regulator. This fundamental 
characteristic implies very fast real-time loops (since the polynomials are actually of 
low degrees, very few online operations prove to be necessary) because all the 
calculation of the synthesis phase are made off loop as soon as the adjustment 
parameters are chosen. This control structure is indicated for applications for which 
the specifications in terms of sampling period are more and more severe (high speed 
machining in machine tools, for example). 
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Parallel to this simple implementation, predictive techniques make it possible to 
satisfy the very strict specifications, in terms of stability, speed and precision (static 
or dynamic) but also in terms of robustness, with respect to interferences or 
neglected dynamics. This level of performance, that “traditional” controls cannot 
achieve is, however, very complex in terms of the choice of adjustment parameters, 
not only by a higher number, but also due to a stronger interaction between these 
parameters. This is why it appears more and more necessary to design, apart from 
the traditional synthesis of the regulator, a support module with adjustment 
parameters, which will make the implementation as transparent as possible for the 
user who is not always a specialist in advanced control laws. The design of a support 
module is more and more conceivable starting with traditional tools of Automation 
for the study and analysis of stability and robustness. 


All the advantages listed above — simplicity, performance, etc. — ensure that 
these predictive techniques are implemented in various industrial applications, in 
very different fields, but preferably when the trajectory to be followed in the future 
is already known, in a way to profit entirely from the anticipative aspect of this 
control law. 


Hence, a privileged field of predictive control is robotics and machine-tool, for 
which the elaborated versions (single or multi-rhythm cascades, with operator 
5 under constraints, etc.) make it possible to deal with sampling periods that can be 
very strict, to cover a very large range of problems that are already known and to 
slightly improve the outcomes that have been accessible so far [BOU 87, RIC 87]. 
however, these techniques were also implemented on slow processes, as the thermal 
systems [CLA 88], the problems of monitoring the temperature of buildings [DUM 
98], or the food industry [DUM 98]. 


Finally, it should be noted that all these structures developed in the context of 
single-variable systems can be generalized to multi-variable systems without any 
particular theoretical difficulty [BOU 96]. The influence of adjustment parameters 
becomes, however, more complex since the scalar study of the stability and 
robustness require the use of the techniques obtained from the -analysis with the 
concept of structured and non-structured uncertainties [BOU 99, MOH 92]. One of 
the multi-variable applications of GPC pertains, for example, to the torque-flow 
control of asynchronous machines for which couplings are very important and the 
operation is non-linear. 


The performances of predictive laws also open the possibility of implementing 
these techniques to adaptive structures, which make it possible to maintain an 
“optimal” behavior when the system presents parametrical drifts in time. The 
perspectives in this field prove to be very interesting because the principles of direct 
predictive adaptive control laws (for which the parameters of the regulator, 


396 Analysis and Control of Linear Systems 


presented in RST polynomial form, are updated directly in real-time and in a single 
step) are added to indirect adaptive predictive versions henceforth traditional. These 
methods do not impose the real-time calculation of the predictors required for the 
creation of the regulator and leave the hope of a gain in time for the significant 
calculation. 


12.5. Bibliography 


[BIT 90] BITMEAD R.R., GEVERS M., WERTZ V., Adaptive Optimal Control. The 
Thinking Man’s GPC, Prentice Hall International, Systems and Control Engineering, 
1990. 


[BOU 91] BOUCHER P., DUMUR D., DAUMULLER S., “Predictive Cascade Control of 
Machine Tools Motor Drives”, Proceedings EPE'91, vol. 2, p. 120-125, Florence, 
September 1991. 


[BOU 92] BOUCHER P., DUMUR D., DAUMULLER S., “Autotuned Predictive Control”, 
Proceedings IFAC Workshop MICC’92, p. 209-213, Prague, September 1992. 


[BOU 95] BOUCHER P., DUMUR D. “Predictive Motion Control”, Journal of Systems 
Engineering. Special Issue on Motion Control Systems, vol. 5, p. 148-162, Springer- 
Verlag, London, 1995. 


[BOU 96] BOUCHER P., DUMUR D., La Commande Prédictive, Méthodes et Pratiques de 
P Ingénieur collection, Editions Technip, Paris, 1996. 


[BOU 99] BOUCHER P., DUMUR D., ROUGEBIEF C., RAGUENAUD P., “Predictive 
Multivariable Generalised Predictive control for Cognac distillation”, 5" European 
Control Conference, F203, Karlsruhe, September 1999. 


[CLA 87a] CLARKE D.W., MOHTADI C., TUFFS P.S., “Generalized Predictive Control, 
Part I “The Basic Algorithm”, Part II “Extensions and Interpretation”, Automatica, 
vol. 23-2, p. 137-160, mars, 1987. 


[CLA 87b] CLARKE D.W., MOHTADI C., TUFFS P.S., “Properties of Generalized 
Predictive Control”, Proceedings 1 0" World Congress IFAC’87, vol. 9, p. 63-74, Munich, 
July 1987. 


[CLA 88] CLARKE D.W., “Application of Generalized Predictive Control to Industrial 
Processes”, IEEE Control Systems Magazine, p. 49-55, April 1988. 


[CLA 91] CLARKE D.W., SCATOLLINI R., “Constrained Receding Horizon Predictive 
Control”, Proceedings IEE-D, vol. 138, p. 347-354, 1991. 


[COM 94] COMPAS J.M., ESTIVAL J.L., FULGET N., MARTIN R., RICHALET J., 
“Industrial Applications of Predictive Functional Control”, Proceedings 3" IEEE 
Conference on Control Applications, vol. 3, p. 1643-1655, Glasgow, August 1994. 


[DUM 92] DUMUR D., CHÊNE A., LAFABREGUE E., BOUCHER P., “Polynomial 
Predictive Functional Controller for a.c. Motors”, Proceedings IFAC Workshop MCIA’92, 
p. 165-170, Perugia, October 1992. 








Predictive Control 397 


[DUM 98] DUMUR D., BOUCHER P., “A Review Introduction to Linear GPC and 
Applications”, Journal A, vol. 39, no. 4, p. 21-35, December 1998. 


[FAV 88] FAVIER G., DUBOIS D., ROUGERIE C., “A Review of K-Step Ahead 
Predictors”, Proceedings IFAC'88 Identification and System Parameter Estimation, 
Beijing, August 1988. 


[IRV 86] IRVING, E., FALINOWER C.M., FONTE C., “Adaptive Generalized Predictive 
Control with Multiple Reference Model”, Proceedings 2" ACASP/86, June 1986. 


[LAN 88] LANDAU I.D., Identification et commande des systèmes, Hermès, 1988. 


[LEV 93] LEVA A., SCATOLLINI R., “Predictive Control with Terminal Constraints”, 
Proceedings pe European Control Conference, vol. 2, p. 932-936, Groningen, June, 
1993. 


[MOH 86] MOHTADI C., CLARKE D.W., “Generalized Predictive Control, LQ, or Pole 
Placement: A United Approach”, Proceedings 25 Conference on Decision and Control, 
Athens, December, 1986. 


[MOH 92] MOHTADI C., SHAH S.L., FISHER D.G., “Frequency response characteristics of 
MIMO GPC”, International Journal of Control, vol. 55-4, p. 877-900, 1992. 


[NIC 93] DE NICOLAO G., SCATOLLINI R., “Stability and Output Terminal Constraints in 
Predictive Control”, Advances in Model-Based Predictive Control, p. 105-121, Oxford 
Science Publications, Oxford University Press, 1993. 


[RIC 78] RICHALET J., RAULT A., TESTUD J.L., PAPON J., “Model Predictive Heuristic 
Control: Applications to Industrial Processes”, Automatica, vol. 14, 1978. 


[RIC 87] RICHALET J., ABU EL ATA S., ARBER C., KUNTZE M.B., JACUBASCH A., 
SCHILL W., “Predictive Functional Control. Application to Fast and Accurate Robots”, 
Proceedings 10 IFAC World Congress, Munich, July, 1987. 


[RIC 93] RICHALET J., Pratique de la Commande Prédictive, Hermès, 1993. 


[SAN 94] SANZO M., RICHALET J., PRADA C., “Matching the Uncertainty of the Model 
Given by Global Identification Techniques to the Robustness of Model-Based Predictive 
Controller”, Advances in Model-Based Predictive Control, p. 386-401, Oxford Science 
Publications, Oxford University Press, 1994. 


[WER 87] WERTZ V., GOREZ R., ZHU K.Y., “A New Generalized Predictive Controller 
Application to the Control of Process with Uncertain Dead-Time”, Proceedings of the 26" 
Conference on Decision and Control, p. 2168-2173, Los Angeles, December 1987. 


This page intentionally left blank 


Chapter 13 


Methodology of the State Approach Control 


Designing the “autopilot” of a multivariable process, be it quasi-linear, 
represents a delicate thing. If the theoretical and algorithmic tools concerning the 
analysis and control of multivariable linear systems have largely progressed during 
the last 40 years, designing a control law is left to the specialist. The best engineer 
still has difficulties in applying his knowledge related to multivariable control 
acquired during his automation course. It is not a mater here to question the interest 
and importance of automation in the curriculum of an engineer but to stress the 
importance of “methodology”. The teaching of a “control methodology”, coherently 
reuniting the various fundamental automation concepts, is the sine qua non 
condition of a fertile transfer of knowledge from laboratories toward industry. 


The methodological challenge has been underestimated for a long time. How 
else can we explain the little research effort in this field? It is, however, important to 
underline among others (and in France) the efforts of de Larminat [LAR 93], 
Bourlés [BOU 92], Duke [DUC 99], Bergeon [PRE 95] or Magni [MAG 87] 
pertaining to multivariable control methodology. 


This chapter deals with a state-based control methodology which is largely 
inspired by the “standard state control” suggested by de Larminat [LAR 00]. 
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13.1. Introduction 


Controlling a process means using the methods available for it in order to adjust 
its behavior to what is needed. The control applied in time uses information 
(provided by the sensors) concerning the state of the process to react to any 
unforeseen evolution. Designing even a little sophisticated control law requires the 
data of a behavior model of the process but also relevant information on its 
environment. Which types of disturbances are likely to move the trajectory of the 
process away from the desired trajectory and which is the information available a 
priori on the desired trajectory? 


Finally, a method of designing control laws must make it possible to arbitrate 
among various requirements: 


— dynamic performances (which must be even better when the transitional 
variances between the magnitudes to be controlled and the related settings are 
weak); 


— static performances (which must be even better when the established variances 
between the magnitudes to be controlled and the related settings are weak); 


— weak stress on the control, low sensitivity to measurement noises (to prevent a 
premature wear and the saturation of the actuators, but to also limit the necessary 
energy and thus the associated cost); 


— robustness (qualitatively invariant preceding properties despite the model 
errors). 


Although this last requirement is not intrinsic (it depends on the model retained 
for the design), it deserves nevertheless to be discussed. It translates the following 
important fact. Since the control law is inferred from models whose validity is 
limited (certain parameters are not well known, idealization by preoccupation with 
simplicity), it will have to be robust in the sense that the good properties of control 
(in term of performances and stress on the control) apply to the process as well as to 
the model and this despite behavior variations. 


This need for arbitrating between various control requirements leads to two 
types of reflection. 


It is utopian to suppose that detailed specifications of these requirements can be 
formalized independently of the design approach of the control law. In practice, the 
designer is very often unaware of what he can expect of the process and an efficient 
control methodology will have as a primary role to help him become aware of the 
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attainable limits. The problem of robustness can also be considered in two ways!. In 
the first instance, modeling uncertainties are assumed to be quantified in the worst 
case and we seek to directly obtain a regulator guaranteeing the expected 
performances despite these uncertainties. At their origin, the H. control [FRA 87] 
and the u-synthesis [DOY 82, SAF 82, ZHO 96] pursued this goal. A more realistic 
version consists of preferring a two-time approach alternating the synthesis of a 
corrector and the analysis of the properties which it provides to the controlled 
system. Hence, the methodology presented in this chapter will define a limited 
number of adjustment parameters with decoupled effects, so as to efficiently 
manage the various control compromises. 


How can the various control compromises be better negotiated than by defining a 
criterion formalizing the satisfaction degree of the control considered? The 
compromise would be obtained by optimizing this criterion after weighting each 
requirement. Weightings would then play the part of adjustment parameters. A priori 
very tempting, this approach faces the difficulties of optimizing the control 
objectives and the risks of an excess of weightings which may make the approach 
vain. It is important in this case to define a standard construction procedure of the 
criterion based on meta-parameters from which the weightings will be obtained. 
These meta-parameters will be the adjustment parameters. 


The methodology proposed here falls under the previously defined principles, i.e. 
it proceeds by minimization of the judiciously selected standard of functional 
calculus. When we think of optimal control, we initially think? of control H, or 
Hœ. We will prefer working in Hardy’s space H, (see section 13.2) for the 
following reasons: 


— the criterion, expressed by means of H, standard ( H, is a Hilbert space), can 
break up as the sum of elementary criteria; 


— control H, has a very fertile reinterpretation in terms of LQG control which 
was the subject of many research works in the past whose results can be used with 
benefit (robustness of LQ control, principle of separation, etc.); 


— the principle of the “worst case” inherent to control H is not necessarily best 
adapted to the principle of arbitration between various requirements. In addition, and 
even if the algorithmic tools for the resolution of the problem of standard H. 

optimization operates in the state space, the philosophy of the H. approach is 
based more on an “input-output” principle than on the concept of state. 





1 In [CHE 93] we used to talk of direct methods versus iterative methods. 
2 For linear stationary systems. 
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In fact, the biggest difficulty is not in the choice of the standard used (working 
in Hœ would be possible) but in the definition of the functional calculus to 
minimize. This functional calculus must standardize the various control 
requirements and be possible to parameterize based on a reduced number of 
coefficients. In the context of controls H) or H,,, it is obtained from the 
construction of a standard control model. This model includes not only the model of 
the process but also information on its environment (type and direction of input of 
disturbances, type of settings) and on the control objectives (magnitudes to be 
controlled, weightings). The principle of its construction is the essence of the 
methodology presented in this chapter. The resolution of the optimization problem 
finally obtained requires to remove certain generally allowed assumptions within the 
framework of the optimization problem of standard H3. 


In short, the methodological principles which underline the developments of this 
chapter are as follows: 


— to concentrate on an optimization problem so as to arbitrate between the 
various control requirements; 


— to privilege an iterative approach alternating the design of a corrector starting 
from the adjustment of a reduced number of parameters up to the decoupled effects 
and the analysis of the controlled system; 


— to express the control law based on intermediate variables having an identified 
physical direction and thus to privilege the state approach and the application of the 
separation principle in its development. The control will be obtained from the 
instantaneous state of the process and its environment. 


This chapter is organized as follows. Section 13.2 presents the significant 
theoretical results relative to the H, control and optimization and carries out 
certain preliminary methodological choices. The minimal information necessary to 
develop a competitive control law is listed in section 13.3 before being used in 
section 13.4 for the construction of the standard control model. The methodological 
approach is summarized in this same section and precedes the conclusion. 


13.2. H, control 


The traditional results pertaining to the design of regulators by H, optimization 
and certain extensions are given in this chapter. Its aim is not to be exhaustive but to 
introduce all the notions and concepts which will be useful to understand the 
methodology suggested later on. 
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13.2.1. Standards 


13.2.1.1. Signal standard 


Let us consider the space Z% of the square integrable signals on [0,0 Í, with 
value in R”. We can define in this space (which is a Hilbert space) the scalar 
product and the standard? defined below: 


1/ 
+00 +00 2 


(x)= [oa p=] f O ar [13.1] 
0 0 


The Laplace transform TL() makes the Hardy space H7 of analytical functions 
X(s) in Re(s)>0 and of integrable square correspond to L5. Parseval’s theorem 
makes it possible to connect the standard of a temporal signal of Z5 to the standard 
of its Laplace transform in H7 : 


tos # 
Ih =[Xl 5f Grp x oxo do [13.2] 


13.2.1.2. Standard induced on the systems 


Let us consider the multivariable system defined by the proper and stable 
(rational) transfer matrix G(s)or alternatively by its impulse response 
g0=TL 0. 








3 Standard whose physical importance in terms of energy is obvious. 
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The “H, standard” of the input-output operator associated with this system is 
defined, when it exists, by: 


‘CO y 

f 
1 x 

lel, =| == f Craon gogoa [13.3] 


Let us note that u(t)e R™ and y(t)€ RP respectively the input and output of the 


system at moment ¢ Let R,,(t), R,,() be the autocorrelation matrices and 
Siw (jO), Syy (jO) the associated spectral density matrices. We recall that these 
matrices are defined as follows. For a given u signal we have: 
T 
i 1 . 
Ry (tT) = lim af Í ult+T) ul (t) dt. For a centered random u signal, whose 
T 


+00 


certain stochastic characteristics (in particular its 2 order momentum) are known, 
Rut) could be also defined by the equality: R,,,,(t)= Elut+t)u' ()). The two 
definitions are reunited in the case of a random signal having stationarity and 
ergodicity properties [PIC 77]. In addition we have the relation: 
+co 

Suu (J@) = [Ru (eat. These notations enable us to give various 
interpretations to the H, standard of G. The results of Table 13.1 are easily 
obtained from Parseval’s equality or the theorem of interferences [PIC 77, ROU 92]. 
They make it possible to conclude that lc| 2 is also the energy of the output signal 
in response to a Dirac impulse or that it characterizes the capacity of the system to 
transmit a white noise*. These interpretations will be important further on. 





4. Characterized by a unitary spectral density matrix. 
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Characteristic of the input signal lcl, Significance 





u(t) = òC) lch =k = Isl, 





[Gf = E| POP |= graph R0) 
u(-) is of zero mean 
u(-) / 


RO = 1,50 = [ graph (SGo) do 


—co 














Table 13.1. Several interpretations of ||G||2 


13.2.1.3. The grammians’ role in the calculation of the H, standard 


Let us consider the quadruplet 4€ R?” ,B e R°°",Ce R?™ De RPP such 
that: 


G(s)=C(sI- A)! B+D [13.4] 


In other words, the state x(t)e R” of the system È evolves according to: 


E BS a 2. 
Colle “ee IE [13.5] 


The partial grammians associated with this system are defined by: 


t 
GP = Í At ppl AT dr 
0 
[13.6] 


t 
G,(t)= f eATCTCeA dr 
0 


Table 13.2 presents the results emerging from these definitions. 
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Input signal characteristic Significance of grammians 


u(t) = I môt) , xo = 0 





t 
Go) = [ xa) an? dr 
0 





“(I u(-) is of zero mean G; (£) = Ext) xt (t)) 
Ru) = 1h80) 





u(t)=0, x(0) = xo 
xo! G (xo = Í WDT D dt 
0 














Table 13.2. Several interpretations of grammians 


G.(t) and G, (t) are respectively called partial grammians of controllability and 


observability. In fact, CAOIN is directly connected to the minimal “control 
energy” necessary to transfer the system from state x(0)=0 to state x(t)= x; 


T 
[KWA 72]. Basically, u(t) = Boge Mo. IG. (4 ) Lx O0<t<t, is the minimal energy 
control (fe ut = x" leo) x] that changes the state x(-) from x9 =0 to t=0 to 
0 


xı to t=t,. 


There are also the following equivalences: 
— (A, B) is controllable & Vt>0,G.()>0; 
— (C, A) is observables Vt>0,G,(t)>0. 


It is shown without difficulty that G, (t) and G, (t) are solutions of Lyapunov 
differential equations: 


G ()=4G (+6 (DAT + BBE G.(0)=0 
C E C 


G W=ATc (n+e HA+CTC G,(0)= 0 [13.7] 


oO oO o 


The partial grammians can be effectively calculated by integrating this system of 
first order differential equations (see section 13.6.1). 
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The “total” grammians (this qualifier is generally omitted) result from the partial 
grammians by: G.= lim G,(T) and G= lim G,(T). Their existence results 
T -Hoo T— +% 


from the stability of the system. They are the solution of Lyapunov algebraic 
equations obtained by canceling the derivatives G.(t) and Go (t): 


AG, +G,A! + BB’ =0 and AG, +G,A+CTC=0. 
The following important property is therefore inferred. Let G(s) be the transfer 


B 
matrix defined by the presumed minimal realization G(s) : = f a ; 


Then: 
|G(s)|} = Graph (BTGB) = Graph (CG,C™) [13.8] 


Numerically, standard H, of G(s) could be obtained by resolution of an 
Lyapunov algebraic equation obtained from the state matrices A,B,C. Let us note 


that matrix G(s) must be strictly proper for the existence of G(s) b : 


A B 
A last interesting interpretation of standard H, of G(s): = (4 e) is as follows. 


Let Bel, B.2,°::Be,, be the columns of B. Let yr; be the free response of the 
system on the basis of the initial condition xo; = B,;. It is verified then that the 
following identity is true: 


leok = pale +215 +2 Penh [13.9] 


Thus, standard H, gives, for a system whose state vector consists of internal 
variables easy to interpret, an energy indication on its free response for a set of 
initial conditions contained in Im(B). 


13.2.2. H, optimization 


13.2.2.1. Definition of the standard H; problem [DOY 89] 


Any closed loop control can be formulated in the standard form of Figure 13.1. 
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Figure 13.1. Standard feedback diagram 


The quadripole G, also called a standard model, and feedback K are supposed 


to be defined as follows, by using the transfer matrices G(s) and K(s) and their 
realization in the state space: 


x A By B; X 
G(s) Gy (s) l 
G(s) = GE GO] C | Di Dile]z|=|G | Pu Dlw 
21 22 
C | Da D» y) [C2 | Pa Poju 








K(s)= Dg +Cx(sI- Ag)! Bg [13.10] 


NOTE 13.1.— the size of each matrix results from the size of the various signals: 
wE R”! u € R”? xe R',ze RP! ye RP? : 


The closed loop system of input w and output z, noted by Tzw , is obtained: 


Tew (3) = Fi(G(s). K(s))=Gr4(s)+ G12(5)K (st G22(K (5) Gals) 


= Dyp + Cop (sl — App ) By [13.11] 
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with: 


Ay = k +B (I - DxDy p DKC BU -DxDy) "Cr | 
Bg(I- Dy Dg) Cy A +Bx(I- DyDg)'DyDx 


Be P + By(I - DgDn Fg 
i Bk(-DnDK) Da 


Cor = k, + Dig(I- DeDxy) DkC2, DoU-DKDn)" cx] 


= 
Do = bu +D(1-DxDn) Dg Da | 


It has the property of internal stability if and only if the eigenvalues of App are 
all of negative real part. 


The standard H, optimization problem is generally referred to as a problem 
consisting of finding Ky, which ensures: 


— the inner stability of the closed loop system Tzw = F;(G,Ky, ); 


— the minimality of the criterion Jy, (Ky, ) = [Ta | 2° 


13.2.2.2. Resolution of the H, standard optimization problem 


The solution of the problem above is well-known [ZHO 96]. To begin with, let 
us distinguish two elementary cases before presenting the general case. 


The “state feedback” (SF) case: it is the case where y=x. All the state 
components of the standard model are accessible for feedback. 


The “output injection” (OI) case: it is the case where the feedback can act 
independently on each component of the evolution equation. This case occurs during 


the design of an observer. 


In these two cases, there are the following particular standard models: 





Gre(s):=| C | Di Do Grs(s):=| C | Du 0 [13.12] 


410 Analysis and Control of Linear Systems 


The optimum of the H, criterion, in the case of the state feedback, has the 
characteristic that it can be obtained by a static feedback: 


-ip T T 
KREy, (5) = Dx = (Dix) Diy) ' (B, P+ Din" CG) [13.13] 


with: 


P20 (P positive semi-defined) 


T T 


ATp+pA-(PB +C!p yo Tp y l(@Tp+p 1c)+CcTc =0 
2 1 12 12 de 2 12 1I 1 1 


The optimal state feedback thus results from the resolution of this latter second 
order matrix equation, named the Riccati equation, which is, for the closed loop 
system, the Lyapunov equation: 


T f 
A+BKRE,,) P+PC(A+ B KRE, )+ (Ci + Di2KRep, ) (C+ Di2kRe,, )= 0 


P is thus the observability grammian of the looped system and it is deduced with 


2 
the optimum: |. i = |F (Gre Kre, I = graph (B,PB7). For the sake of 














completeness, it is necessary to specify the existence hypotheses of a solution to this 
problem: 
— pair (A, By) must be stabilizable in order to enable the stability of the looped 


system. Let us note, however, that if the inner stability of the looped system is not 
required, the hypothesis according to which the non-stabilizable modes by u are all 
non-controllable by w or unobservable by z is enough. Gain K RE,, Can then be 


determined from the state representation reduced to the only stabilizable states as we 
will see further on; 


—D,, =0 is a condition which generically ensures the strict propriety of T„„ and 


thus the existence of its H standard; 
—D;,, must be of full rank (per columns) to ensure the reversibility of Da Dis 
B 
in the Riccati equation. Similarly, the zero invariants of Gj2(5) : = - i ) must 
1 12 


not be on the imaginary axis. 
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The H, solution, which is optimal in the case of output injection, is obtained 
directly from what precedes by application from the duality principle (see section 
13.6.2). Under the dual assumptions of those stated previously, we obtain: 


T T es 
Kis,, (8) = Dx =— (Cy +BD;1 ) (Dz) Da1) [13.14] 


S20 (Z positive semi-defined) 


with: 
AZ +A" (SCT + B DE ND: D21) (CE 4+Dy,B )+ BB =0 
2 2 
At optimum, |r = k (G , x) = Graph(C Tsc ). The existence hypotheses 
wh lll Is lb Et 




















of a solution to this problem are themselves dual of those of problem (RE). 


The H, solution — which is optimal in the general case, is this time a dynamic 
system of the same size as the standard model. It is obtained from the two preceding 
elementary cases by applying the separation principle [AND 89]: 


A+ BoKrp, + Kis, C2 t Kis, DorKre, | Kis 
PER 2 H2 H2 H2 H2 
Ky | Re, 5 [13.15] 


Moreover: 


2 


| Fees) [=a (6(s)-Kn, (s)) |; =| Fi (Gre (5). Kreg |, + [Fi (G1s(5)- Rasy, |, 


Let us sum up the existence conditions of this solution to the standard H, 
problem (A,B) stabilizable and (C2 A) detectable. 


A- jol B, 
VoE R, and Dj» are of full rank per column. 
CG Dp 
A-jol B 
Voe R, A and D>, are of full rank per row. These hypotheses 
2 21 


are easily understood if it is known that at optimum, the poles of 7,,,(s) tend toward 
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the zeros of transmission of G,7(s) and G;,(s). In addition, the remaining invariant 
zero are non-controllable modes by Bı or non-detectable modes by Cı which 


would be preserved in closed loop. Hence, the absence of infinite zeros or on the 
imaginary axis is imposed. 


Di, =0. 


13.2.3. H, - LOG 


Various interpretations of the H, standard provided in the preceding section 
enable us to establish the link with Kalman theory and LQG control (see Chapter 6). 
If w is a centered, stationary, unit spectrum white noise, and if the standard model is 
that in Figure 13.2 [STE 87], we obtain: 


F-2-2nfrfrote after er ey [tte 


and NA EC wT ]- ee 


T 
y Np W 





e-n [13.16] 


The two elementary cases previously discussed in relation to H, correspond to 
the case of LQ control and the design of the Kalman filter. We have Kzọ =-K REy, 


and the control law by state feedback u=-K;ox minimizes 


T 
Q N t _ 
Jug = lim + Í EOLO A *O | d. In addition, for Lex =Kis, à 
TT à Ne R u(t) 2 
the observer $= AX + Bu + Leg (y = Cx) is precisely the Kalman filter minimizing 


E dae-əP under the hypotheses of evolution noise w, and measurement 


noise w, previously defined. 
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Figure 13.3. LOG structure (state feedback/observer) 


The resulting control law illustrated in Figure 13.3 has the structure of the state 
feedback/observer: 


u = —KL0À 
À = (A-Lpe C2)? + But Lpg- CR) 


t [13.17] 
Kn, (s) = -Kro(sI - A- BKro +KpgC2) Ka 
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Finally, the equivalence between the standard H, problem and LQG problem is 
i T 
obtained for: Q=C]Ci, R=DhD, Ne =C] Di, V =B,B] , W = D3,D1, 
T 
Np = B,D, 


However, for Hy, matrices O,R,N,,V,W and N pcan be officially considered 
weighting matrices. In order to be able to wisely choose these weightings, the 
designer must make use of methodological rules like the ones suggested in section 
13.4. 


13.2.4. H,- LTR 


According to what was said above, the plethoric works (see [CHE 93] and the 
references included) on the LO control, LO with frequency weightings and LQG can 
be useful in the context of H, control. This is true in particular for the results 
relating to robustness. 


It has been known for a long time that the LO control gives to the looped system 
enviable properties of robustness (see Chapter 6 and [SAF 77]) The exteriority of 
the Nyquist place with respect to the Kalman circle guarantees good gain and phase 
margins, as well as good robustness with respect to static non-linearities (criterion of 
the circle [SAF 80]) and a certain type of dynamic uncertainties’. These properties 
are obtained at the beginning of the process. 


u y 
>= )x »-[G(s)| 
O | O £ 


Process 


Observer 
































State feedback 


Figure 13.4. Analysis of robustness of the LOG control 





5 A(s)= (L pu (s)—L, (s))L, (sy! relative uncertainty on the input loop transfer if L, and 


L „ represent the nominal and disturbed loop transfers. 


pu 
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The robustness properties of the LQ control (or of Hagg regulator) can be lost 
in the general case, i.e. when the state of the system is inaccessible. The addition of 
an observer, be it the Kalman observer, in fact modifies the loop transfer 
Luo (s)= Kzo(sl - A)! B, obtained in the case of state feedback. As an example 
we will verify that L, (s) is also the loop transfer of control LOG if we open the 
loop of Figure 13.4 at point ©. Unfortunately, the need for robustness is felt at point 
© and not at point @ (uncertainties due to the actuators). The LTR technique (Loop 
Transfer Recovery according to the Anglo-Saxon terminology [STE 87, MAC 89]) 
consists of choosing for problem H, / LOG , a particular set of weightings, allowing 
the restoration of the loop transfer L, (s) in point © this time. In the diagram of 
Figure 13.5 it appears obvious that this will be at least closely obtained on the only 
condition that the transfer matrix K79(sJ— A+ LrgC2 yo B is small in terms of a 
certain standard. This will be the case for the following particular choice of 
weightings (for the Kalman filter): 


B, =B, Dy, 0 & V=B,B,", W>0, Nz 0 


This result is formalized by the following proposition. 


Proposition (primal LTR) 
Bi =B>, Dy, 90 = Lpg minimizes |Krolst -A+ Lex Co) ‘Ba 
2 


Moreover, if the process is at phase minimum and reversible on the left 
[krols -A + LrkC) Bol, — 0 and the robustness of LO 6 [AND 89], [SAF 80] 


is recovered for regulator Hy — LOG at the beginning of the process (at point ©). 


NOTE 13.2.— the demonstration of this result, omitted for lack of space, uses the 
separation principle presented in section 13.2.2. 


We obtain by duality the following proposition. 
Proposition (dual LTR) 


Cı =Cz, Di2 —0 = Kro minimizes ls -4+ 8x0) "Le: 





6 S,(s)=U tly, (s))7! satisfies the equality [bisu D| =1,if CD = 0. Note: 


A = 
Di” represents the reverse on the left of D12 : Dj" =(Di27 Dyn) | Di! . 
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If moreover the process is at phase minimum and reversible on the right 


lo — A+ B;,K zo) ‘Ere, — 0 and the robustness’ is recovered for regulator 


H, -LQG at the end of the process (at point ©). 


Hence, the dual LTR makes it possible to obtain good robustness margins with 
respect to uncertainties at the output of the system resulting in particular from 
sensors. 


Let us note that “input robustness” and “output robustness” are not necessarily 
antagonistic and that in the majority of the encountered practical cases, these 
properties converge. It is at least the bet of the standard state control presented in 
[LAR 00]. 


Cor - 
Process 


(sl A + Lex)" Bo 








(©) 










ULQG 





State feedback 


Figure 13.5. Equivalent LOG diagram 


13.2.5. Generalization of the H, standard problem 


The required (and commonly approved) hypotheses in the formulation of the 
standard H, problem are too restrictive to be able to rigorously solve the majority 
of control problems, at least by adopting the methodology recommended in this 
chapter. If the hypothesis “ D,, and D>; of full rank” can be made less strict by 
preferring a resolution of the problem based on the latest developments regarding 
the optimization by positive semi-definite programming’ [GAH 94, IWA 91], the 
internal stability of the relooped standard model F,(G,K) always appears as a 
constraint. As underlined in [CHE 93], this is restrictive in the context of the design 





7 That of the Kalman filter this time. 

8 The problem is formulated as an optimization problem under the constraint of Linked 
Matrix Inequalities (LMI). The numerical tools related to this type of optimization are from 
then on entirely competitive. 
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of a regulator because the constraint should only relate to the internal stability of the 
process and not of the standard model which potentially includes dynamic 
weightings. For this reason, we consider it useful to present a generalized version of 
the standard problem for its use in the context of the methodology of the control 
suggested further on. 








Figure 13.6. Toward defining the H, generalized problem 


The generalized H, problem can be formalized as follows. Let us consider the 


looped system in Figure 13.6 with we R”™!, we R”? , xe R”, ze RP!, VE Ree. 
A realization in the state space of the standard model G(s) can be directly deduced 


from those presumed minimal of W,(s), G? (s) and W, (s): 





0 0 0 Aw, Bw, Aw, Bw, 
Go(s):= Ci Di Dh W,(s):= W,(s):= 


No Cw, | Pw, Cw, | Pw, 
c9 | D9 o0 
U 
[ Ay, 0 0 By, 0 | 
B? Cw, A o0 B? Dw, BS 





0 0 0 0 
G(s): = By, Di1Cw, Bw Ci Aw, By, Di Dw, By Dh = 
0 0 0 0 Cy) | Di D 
Dw, PuCw, Pw. Cw, | D, DD, Pw, Diz oe 
0 0 0 
D31Cw, CG 0 D;1Dw, 0 














[13.19] 
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We will assume Di, = Dy, D? Dy, =0. By construction, the modes of W,(s) 
are unobservable by y, whereas the modes of W,(s) are non-controllable by u. If 
these modes are unstable, the standard model G(s) is non-stabilizable by u and non- 
detectable by y. The standard H, problem cannot be solved (we are outside its 
context of hypothesis). 


DEFINITION OF THE H, OPTIMIZATION PROBLEM GENERALIZED — it is 
a question of finding Kp, which ensures: 


— the inner stability of the looped process T, , =F, (G°,Ky,) ; 


— the minimality of the criterion Jy (Kp, ) = [Talh : 





LEMMA 13.1.— the existence of a solution to the problem above requires the 
hypotheses (H0) to (H3) below. 


HO. The poles W,(s) and W,(s) are of non-negative real part. If this were not the 
case, it wouid be enough to incorporate the stable parts of W,(s) and W,(s) into 
the model G° (s). 


| A 0 BS Bee i Ay, 0 
H1. The pairs Bice Ay, > By DY, and 21, B° en Ao | fare 


respectively stabilizable and detectable. If these hypotheses were not satisfied but if 
(c9 AO BD )is stabilizable and detectable, we will reduce beforehand the standard 
model (see [FRA 77]) so that it satisfies (H1). 


H2. Dip (respectively D>, ) is of full rank per column (resp. per row). 


H3. The realizations of Gy(s) and Gas), obtained from equation [13.19], have 
no other invariant zeros on the imaginary axis that belong respectively to the 
spectra of Aw and Ay . Precisely: 


AP -joa 0 B 
0 0 
H3.1 | Bw,Ci Aw, Bw Dn | is of full rank per columns VOE R 
0 0 
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Ay = jol 0 By 


0 0 oe 
H3.2| Bi Cy, A By Dy, | is of full rank per rows VOE R 
0 0 no 
Dy Cy, C2 PaP, 


H4. Di =0. 


THEOREM 13.1 (SOLUTION TO THE GENERALIZED H, PROBLEM ).- under 
the hypotheses (H0) to (H3) of lemma 13.1, we can show that the generalized H, 
problem admits a solution if and only if: 


SP>0/ A’ P+ PA-(PB> +C," D Di! Dip) 1 (Bo! P + Di! C1) + Ci C, = 0 
3£20/ AS+3AT -(5C,7 + B,Dy,! D! D21) (CoE + Do,B,' )+ B By! =0 


Hence K=-(D' D ÿ (BlP+D1C) 
12 12 2 121 


and L=-ŒC +B’ D )(D pD y'i. 
2 1 21 21 21 
It is shown that: 
— the only unstable modes A+ ByK are the unstable modes of Ay. ; 
— the only unstable modes A+ LC; are the unstable modes of Ay ; 


— the optimal regulator has the same size as the standard model and is given by: 


A+ B2K + LC) | L 
Ky, (9) -( = z) 


Note that the separation principle continues to apply. 


We can also show the following original result which establishes the link with 
the well-known Regulation Problem with Internal Stability (RPIS) introduced by 
Wonham [WON 85]. 
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THEOREM 13.2 (HIDDEN EQUATIONS).— under the same hypotheses as 
previously, properties 1 and 2 are equivalent as well as properties 3 and 4. 


SP >0/A! P+ PA-(PB, + Ci Diy XDL D) (BE P+ DLC,)+CTC, =0 


Ti Aw, — 0 Ta- 


A} 0 BS Bee 
À o [Kat = 0 
I(T, Ka) / Byc) Ay By, Dip 


0 0 0 
(Dr Ci C, ) Ta + Dw, Di2Ka- Dw DiiCw, =0 


J£ >0/ AL+EA? -(5CT + BDA (D4 D21) (CE + Dy Bj )+ BBE =0 


Ay 0 
Aw Sa a + C?)+(0 By C1)=0 





(Sas La) / 


Lu 


By PCy, 
By, 0 0 
BÌD N E A =0 
Bi D 


Furthermore: 


T E 
P= Ta PTa Ta Ps 
PTa Ps 


where solution P, of the Riccati equation reduced to the controllable part by u is 


solution of 1. 


>5=| = Sy 
SaZ1 Sa¥iSa 


where solution X; of the Riccati equation reduced to the observable part by y is 
solution of 3. 


Let us give the idea of the equivalence proof of properties 1. and 2., the 
equivalence of properties 3. and 4. resulting by duality. 
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We show that 2.=>1. by partitioning the solution of the Riccati equation 


A P 


according to P = 
& | PT P 


} with P, matrix of the same size as Ay and then by 


T BT; T,” B 
PTa P 


Riccati equation reduced to the controllable part by u and T, solution of 2. 


verifying that r-| | is solution if we choose solution P; of the 


Reciprocally, we can deduce that 1.= 2. as follows. Equation I can be “seen” 
as the Lyapunov equation associated with the observability grammian by z of the 
looped system if u=Kx, with K defined in Theorem 13.1. The existence of a 
solution P>0 leads, according to lemma 3.19 of [ZHO 96], to the conclusion that 
(A+ BK) is stable even since the looped system is detectable by z. The non-stability 
of the pair (A, B) leads to the conclusion that the looped system must necessarily be 
undetectable by z and, consequently, that equation 2. admits one solution. O 


NOTE 13.3.— Theorem 13.2 generalizes the former reflections [LAR 93], [LAR 00] 
in the case of output frequency weightings. It introduces the dual problem of the 
regulator [DAV 76, FRA 77, WON 85]. Speaking of hidden problems would be 
more general. The problem of the regulator consists in fact of hiding, by a proper 
feedback, the non-stabilizable modes by u (interpreted as disturbances) in order to 
make them unobservable by z. The dual problem seeks to hide the non-detectable 
modes by y so as to make them non-controllable by w. It is clear that the existence of 
a solution for the H, problem is subordinated to the existence of a solution for each 
one of these sub-problems. 


When they exist, the solutions to the hidden equations are not necessarily single. 
Equation 2 of Theorem 13.2 is a necessary and sufficient condition to the Regulation 
Problem with Internal Stability (RPIS) which is well-known in other works [WON 
85]. The uniqueness of (T,,K,) is acquired as soon as GP (s) is reversible on the 
left and does not have zeros among the eigenvalues of Ay [STO 00]. In a dual 
way, the solution (S,,L,) in (4) will be unique if GS 1 (5) is reversible on the right 
and does not have zeros among the eigenvalues of Ay. - 


13.2.6. Generalized H, problem and robust RPIS 


Let us consider here the case of a standard model that does not have unstable 
modes unobservable by y. This restrictive and simplifying hypothesis will not block 
the “State Standard Control” type methodological developments. From the H3 
generalized problem, we can establish the following result which shows the presence 
of an internal model [WON 85] within the regulator. 
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Theorem 13.3 (H) REGULATOR AND INTERNAL MODEL). let us suppose 
satisfied the hypotheses of the generalized H, problem (degenerated hypotheses if 
the standard model does not have unstable modes unobservable by y). Let us 
suppose moreover that there is a solution to equation 2 of Theorem 13.2 (section 
13.2.5). Then, the H, optimal regulator (see Theorem 13.1) contains a copy of 
unobservable dynamics of the pair (CT, + D21, Aw ). 


The demonstration results from corollary 3.3 of [STO 00]. Oo 


COROLLARY 13.1.— the duplicate within the regulator of unobservable dynamics 
of the pair (CT, + D21,Aw,) is basically Wonham internal model. The A; 
regulator thus obtained satisfies the principle of the internal model. 


In what follows, we will seek to specify the conditions in which one will have a 
robust H, regulator where property lim (t) = 0 (condition of existence of IZ lb ) is 
10 


verified despite the arbitrarily small uncertainties on G°(s). Because if we know 
(see [HAU 83]) that the stabilizing and detectability properties are preserved for 
small disturbances on G(s) (or the state matrices which characterize it), it is not the 
same for the hidden properties. 


Besides the hypotheses of the H, problem, we will suppose that the W,(s) 
modes are perfectly known and that there is a matrix M, of appropriate size such 
as C} = M,C). In addition, the result statement will be facilitated by the introduction 
of the following notation: 


O Vv 0 0 0 
ww. c 1 0 0 
q' S q'xq' 
4A(o,v,,v,)~ 0 v O VW 0 JER 
0 0 wi © 
0 0 0 


THEOREM 13.4 (ROBUST RPIS).- let m(t)=(Po +pit +: +pqt1)e™ be a 


q 
mode of the exosystem W,(s). Hence, A, is similar to matrix (a o) with 
; D 





g'=q,0=2v=1v =0 if AER and q'=2q, c=re(A),vy, =im(A), 
v =-im(A) if AE C. The H, optimal regulator contains a strong internal model 
[WON 85] associated with the mode m(t) and consequently the rejection of the 
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mode m(t) on z(t) will be robust if Ay, is similar to Ay. -[' Bah zh In 
0 * 


other words, the mode m(t) must be observable p, times by y if yt) € R”. 


If the presentation of this result is original and in particular the relation with the 
H problem generalized, its demonstration can result from the traditional results on 


the regulator problem (see [ABE 00, HAU 83] and the references included). oO 


The result of Theorem 13.4 is important as it provides a key for the weighting 
choice W,(s) when we wish to reject a robust disturbance. It reiterates the “principle 
of sufficient duplication” introduced in [LAR 00]. 


13.2.7. Discretization of the H, problem 


Let us consider that we must implement on the computer a regulator H, 
designed beforehand in continuous-time [GEV 93, WIL 91]. A slightly clumsy way 
would be to approximate a posteriori the continuous-time regulator. We recommend 
the following way which proceeds by discretization of the H, problem and direct 
calculation of the optimal discrete regulator. 


Hence, the problem consists of determining the discrete-time regulator Kgy, (2) 
which will give to the numerical control? in Figure 13.7 a behavior close to that of 
the analogical control resulted from feedback K H, (s). Therefore, let us try to define 
the H standard discrete problem for which Kgy, (z) would be the solution. 





9 Bo and Er represent respectively the 0 order blocker and the sampling operator in 
accordance with Chapter 3. 
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Figure 13.7. Discretization of the standard H problem 

















Discretizing problem H, does not imply discretizing the standard model by 
writing Gg = Er ° Go By. By doing this, the behavior of the system in between two 
samplings would be neglected. Similarly, w would implicitly receive spectral 
properties that it does not have (w(t) cannot be assumed as constant between two 
sampling instances). The following lemma makes it possible to obtain Gg from 
hypotheses 1 to 3 below: 





2) the sampler Ey incorporates an ideal spectrum anti-aliasing filter (band-pass 


_Je fe 
2° 9 


filter of unitary gain on l | if fe is the sampling frequency). At this level 


we could also introduce a delay so as to take into account the delay of calculation. 
We will not do it here in order to avoid complicating the writing of the discrete 
model; 


3) the input of control u(-) remains constant between two sampling instances. 





10 This hypothesis is not at all compulsory. It just simplifies the writing of the resulting 
discrete model. 
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LEMMA 13.2 (OBTAINING THE STANDARD DISCRETE MODEL). under the 
preceding hypotheses, the discrete standard model Gg results from the continuous 
model by: 


Aq | Bu Ba 
Gio:=|Cnl 0 Dap [13.19] 
Ca2 | Pan 0 





with: 


T; 
By = [Ga (T, We 7 i.e. BaBa" = f e^TB BT e47 dT 
0 


[Ca Datz|=[Gao (Te y , Le. 


ea TE 0 IRC ale? Fa 


Dai 


D 
C2 = C2 and D421 = 21 


ve 
The model thus discretized makes it possible to write Theorem 13.5. 


THEOREM 13.5 (DISCRETIZATION OF H, PROBLEM).— Kay, (z) minimizing 


|F; (Gap, Kan, ) 


ZaWa 


LEA 











5 =| à also minimizes ll = | F (Gy, -Bo °Kan, oEr)|, . 


We once more neglect to give the proof of the theorem. At least let us give the 
idea. It is shown initially that the state xg, of the model Gj, has the same 


dynamic and stochastic properties as the discretized state x(kT,) of Gy, - It is 
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shown then that bal, Eza Zax = [db Finally, yq, is obtained by 
k 


discretization of the white noise D2\w. Oo 


NOTE 13.4— we have, by definition of the H, standard in discrete time, 


z À 
1 +7, | | 
Taw l= ee Í Graph Tå w (OT, (ee) do| . Standard [fe], 


7T, 
T, 


e 








cannot be interpreted this way. T„„ transfer uses continuous and discrete-time 


signals at the same time and ||T. is defined only through its interpretation in 
8 z y 8 TP 


wll 
terms of induced standard. 


NOTE 13.5.— the way the H discrete problem is solved is completely similar to the 
way the H, problem in continuous-time is solved. Only the formal expressions of 
Lyapunov and Riccati equations change [ZHO 96]. Alternatively, we can use the 





operator Ôr —Yr (see section 3.3) and the complex variable w = ees . In this 
e k = 
e 
case, the A, standard of the G(w) transfer is defined by 
1 
1 Ds Graph ie A 2 eT. 
IG, = — | Trace (G (w)G(w)) dw with w =—— and we find 
27 Te ele +1 


the same results as for continuous time. 


13.3. Data of a feedback control problem 


The goal of a methodological guide is to offer support to the designer of a 
control law throughout the design chain. Before being able to formalize the control 
problem through an H, optimization problem, since it is the part included here, it is 
important to proceed in a systematic way. Which are the “contours” of the system to 
be regulated? What do we know of its environment? Which are the means of action? 
Which is the available information in real-time? Which are the control objectives, at 
least from a qualitative point of view? These questions and the formalization of their 
answer belong to the methodological approach. We will use as much as possible the 
notations adopted in [LAR 00]. 
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13.3.1. Model of the process 


The first stage consists, without any doubt, of proceeding on the basis of 
functional reasoning. How should we define fhe system to be adjusted and which 
function must it have? From the nature of its function, we will infer the magnitudes 
that need to be controlled, denoted as y,, and at least qualitatively the control 
objectives. We will define on this occasion the setting or reference magnitudes r. 
Secondly, it is a question of defining the means of action necessary or useful for the 
achievement of the control objectives. The action magnitudes are reunited into a 
control vector noted by u. Any other means of action that is not used for the 
feedback control considered will be labeled as disturbance. The other sources of 
disturbances result from the environment of the system to adjust. The disturbance 
inputs are gathered in vector d. It is finally necessary to keep track of the 
measurements or observations which are likely to be used to carry out the control 
law. y, defines the vector of the noticed outputs. 


The contour of the system to be adjusted is “drawn” at the end of these first two 
stages through the data of its inputs-outputs d,u,y.,y,. It is then important to 


model the relations of cause to effect between these various magnitudes, starting 
from the equations of physics which govern its behavior, or directly by minimizing 
the distance between the inputs-outputs of the system and that of a mathematical 
model. We make the assumption here that this model is linear and defined by 





A | Ba By 
Gsar (s) =| Cy, | Dya Dir according to the diagram in Figure 13.8. 
Cy, | Dya 0 








Figure 13.8. Model of the system to be adjusted 


The model can then be simulated for various input signals in order to well 
understand the behavior of the system and the evolution of its internal variables 
Xs4r- It is important from now on to specify the operation environment of the 
system. 


428 Analysis and Control of Linear Systems 


13.3.2. Modeling the environment of the system to adjust 


Controlling any system consists of controlling its actuators so that it can fulfill 
the required function. The control law naturally depends on the function to be 
accomplished and on the conditions under which it must be accomplished. The 
magnitudes to be controlled were already defined in section 13.3.1. It is still 
necessary to specify the requirements imposed on them. Similarly, macroscopic 
information on the type of disturbances to which the system is subjected can be 
validly used during the design of the control. 


Let us specify these aspects by using the example of an electric machine. The 
same machine can fulfill very different functions. Does it have to function with an 
engine or a generator? We implicitly answer this question while defining which are 
the control magnitudes and the magnitudes to control. Typically, position or speed 
control pertains to an “engine” operation. Let us suppose that we are interested in a 
position control. The formulation of an efficient control law requires more 
information on the operation conditions of the engine. Which position profile does it 
have to follow? Is the position reference likely to vary in an abrupt way, with stages, 
or on the contrary its evolution is linear, as it can be the case sometimes in robotics? 
It is the same with specific applications where the reference is quasi-periodic, even 
sinusoidal [DET 99]. In addition, does the disturbance, which is here a resistive 
torque, have a characteristic “signature”? 


A regulator, be it a little sophisticated, will use this information to predict and 
anticipate the future evolution of the system. It is clear that the quality of this 
anticipation depends on the capacity of the regulator to predict the evolution of 
references and disturbances on the basis of their past evolution. For this reason, the 
development of a control law necessarily supposes (sometimes in an implicit way) 
the definition of predictor models which specify and formalize the environment of 
the system to be adjusted. In what follows, é will indifferently define the 
disturbance d , the reference r, or the aggregation of both. Generally, the signal & 
will be described by a Markov model as follows: 


HE oC 


Let us define w(t) = D GS-1) with ô as a Dirac distribution, (g;)j<y and 
i 


(t;)ien as two independent random sequences with value in R. (f;);en is strictly 


increasing. 
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This choice enables the adequate description of many standard signals. Consider 
the following examples: 


E - K a Ea e 6(t) = Lane t;) 


enables the description of a Wiener noise. 


0 1 0 
t — p Dot) 
Fee 0 -a | a ofa D» 1—e %% Raa 
c(t) 1 0 0 w(t) Xe (t) : aye Oot“) 


describes the evolution of the standard signal represented in Figure 13.9. For 
@, =0, & evolves in ramps piece by piece. 


‘ 0 ll 0 
Fa) -a 0 l0 1 PIG 
ŠA) w(t) 


1 
E(t) = È| a! cosol -1)+ q? sinal- 1) | Tr@-t;), si a$] 


di 





1 


describes the evolution of a harmonic signal. 








Figure 13.9. Model of a 2" order aperiodic signal 
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Under certain hypotheses on the random variables q, and f., the signal E(r) has 
second order statistical properties equivalent to the signal E'(r) obtained by 
injecting at the input of the model a centered white noise w'(t) [BRO 92, LAR 93]. 
This second form is besides better adapted to take into account a disturbance on the 
signal measured which is often connected to a colored noise. 


Any input signal w or w' is irreducible in both cases and the best prediction of 
Xe (t + At) that can be done at instant ¢ is given by Xe (t + At) = ns” (t). This 
point is illustrated in Figure 13.10. xz(r), being internal to the signal model É, 


incorporates everything that we know on the past evolution of é. 


Predictor model 


dE (2 \2, 0 | 
SD, Poso 







s 








Figure 13.10. Sinusoidal prediction 


Finally, the designer task consists of providing the type of the prediction to be 
carried out (constant, in ramp, sinusoidal prediction). This information is formalized 
through the data of predictor models of disturbances and settings: 


“(2 7G) = GENE) wx 


13.3.3. Additional data 


The designer may specify certain statistical information that he knows such as 
the average frequency of a disturbance appearance, the characteristics of a 
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measurement noise, etc. A H, criterion may be inferred in its “LQG” interpretation 
(see section 13.2.3). This approach is not viable for the design of a control law. Even 
if we exclude the fact that obtaining this information is a problem in itself, 
minimizing the hope for the output signal of the standard model would not guarantee 
in any way the dynamic performances expected. The interpretation of the H, 
standard as “energy” of the impulse response is better adapted here. 


Matrices O,R,N..V,W,N f that intervene in the criterion H —- LOG must be 
interpreted as weighting matrices. They cannot be left as such in the hands of the 
designer as he may give up in front of the difficulty of the task or he may settle for 
an average adjustment. Better than a few empirical rules, the question of choosing 
the weightings deserves a true “methodological hat”, which makes it possible to 
obtain the weightings of a limited number of adjustment parameters with well 
understood effects. The designer will be able then “to adjust” these control 
parameters to best manage the control compromises. The definition of such “a 
methodological hat” is the subject of the following section. 


13.4. Standard H, optimization problem 


The relation between an H optimization problem and the synthesis problem of 
a regulator based on the information collected in the preceding section seems still 
fine at this stage. We will describe the various construction stages of the standard 
model for the H, optimization, based on the information available in section 13.3 
on the one hand and on the calculation of the weight matrices from the parameters 
left at the designer’s free choice on the other hand. 


13.4.1. Construction of the conceptual control model 


The magnitudes penalized in the H criterion must be selected in a coherent 
way to lead to a clear optimization problem. This is what is carried out by the 
following stages and is summed up by the diagram in Figure 13.11. 


Once the signals d,u,y.,y, are defined as indicated in section 13.3, we 
establish the process model Gs4r (s) defined by: 


Xsan) | Asar | Ba By (sar 
Se |= C D,a Dyu d 


Ve 
Yo C D 0 u 


Ye yd 
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We associate an additional input which will enable the additional incorporation 
of a noise on the evolution equation while defining Gs4rw (s) by: 





u 


G X 
tsar) |Asar | Ba [0 I] B aun 





d 
Ye |=| Cy, [Da 0 Dyal [13.22] 

X 

Yo Cl Die 50 0 $ 





We define the predictor model of disturbances and settings with which we 
associate an evolution noise whose intensity will be fixed later on according to the 


: Az |B 
ie cee a 
control objectives: | y |= Cz, . We define the setting variance 
0 | (wy, 
d Cg, 


e=r-—y, which will have to be brought back to 0. 


We seek the one trajectory of the process (-)(u,(-),x,()) which ensures the 
nullity of the setting variance and thus y, =r. It results from the state of the 
predictor model of disturbances and settings according to relations x, =7,xz and 
ua = Kaxg, in which T, and K, are solutions of Sylvester equation: 


as — AsarTa — B, Kg + ByCz, = 0 
Cy Ta + Cz =0 


To conclude, we define the control variance e,=u-u, and vector 


F 
y= | J+ w, ofthe magnitudes accessible to the regulator. 
Ye 
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Figure 13.11. Conceptual control model (CCM) 


We easily obtain a minimal realization of the CCM from what precedes. We 
note: 


e W. 


y 
pe . Gmcc, | 
= w 


Gucc, Gucc,, 


= 


[13.22] 


CHERS 





Gucc(s) 
We are able from now on to associate with the control problem a clear and 


relevant H, optimization problem. This will be done in the following section. 


13.4.2. Definition of the H, optimization problem 


y7 1 I 2.1 
On the basis of CCM and weighting matrices Q, = 0,12 0/2, Re = RJ RJ 


T T 
and 0, = 0,/20,/2 R, = RAR /⁄ , we build the standard model Gy, (s) 


defining the H, problem (see Figure 13.12): 
1 1 
RP 0 0 RP 0 0 
ai 2 2 
Gy, (s)= 0 c 0 Gucc\s) 0 O O}= 
0 0 l 0 0 Tf 
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Figure 13.12. Definition of the H, problem 


The H, problem that consists of calculating K(s) ensuring the internal stability 
of Goapr(s) and minimizing lz; (Gp, KO, is a generalized clear H, problem 


(section 13.2.5) considering the previously taken precautions (penalization of 
variances e, and e with respect to the asymptotic trajectory). It remains to be 


entirely defined. Choosing the weighting matrices is potentially choosing 


X Tuce +1) 


AMCC coefficients if nycc is the order of the CCM. It is too much to 


be able to properly handle them. Other works unanimously refers to Bryson’s rule 
[BRY 69] which suggests using the diagonal matrices in order to simplify the 
problem. Hence, each coefficient is applied to a state variable or particular control 
and the “rule” stipulates choosing it in accordance with the variation range (existing 
or desired) of the variable considered. It is implicitly supposed that each variable has 
an obvious physical direction. 


This standardization\\ approach is good but still it is too approximate and often 
delicate. De Larminat in [LAR 93] recommends standardizing the system according 
to the controllability or observability on a determined horizon, based on the partial 
grammians (see section 13.1). This approach seems to be the proper one (we will 
justify this) and is what we will present here. 


Matrices Q. and R, will be inferred from the choice of a control horizon T, : 


c 


T, 
Ro =T, fo (Cy, e^ B,) (C, e^“ B, at, Q. =I [13.23] 





11 The interest was always in standardizing the units of descriptive physical magnitudes of 
the process considered by expressing them in percentage of the nominal value. 
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Symmetrically, matrices Q, and R, will be inferred from the choice of an 
observation horizon T, : 


T, Ato To „Atg 
Q, =T, [ec Ge" atl R, =I [13.24] 
Let us note various possible rewritings for matrices R, and Q, : 


T. 
Re = Tefo (Cie By)" (Cie By) dt = TB» Go (T.)Bo 


T, AT a = 
Q, =I, Í 5 e^ O Ced =1T Goo 


Understanding these choices implies returning to the various interpretations of 
the grammians given in section 13.2.1. We leave this exercise to the reader who can 
refer to [LAR 00] for more details. We will mostly deal with the consequences of 
such a choice. 


13.4.3. The interest in standardization 


The previous choices are of undeniable methodological interest. From the 
standardization approach it results that the horizons T, and T, respectively adjust 


the dynamics of the control modes and of filtering modes (see separation principle). 
It is noted in [LAR 00] that the control poles, for example, are in general at the left 


. 1 ; : 
of the real axis ———. Our experiment corroborates this result. A second result, 
C 


equally significant, is stated in the following proposition. 


Proposition: “standard” character of SSC 


The properties of the feedback control system obtained according to the standard 
state control (SSC) procedure are independent of the state representation of the 
model of the system to be adjusted and of that of the predictor models. 


Let us provide the demonstration idea of this original result. It results directly 
from the two following notes. 
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The corrector K opt = arg lA (Gy, (s), K(s)) | is independent of the 
minK(s) stabilizing 2 


state representation chosen for Gy, (s). 


Let us consider the models Gi (s) and Ge (s) obtained from two distinct 
2 2 
creations of Gycc(s) in the state space. Let T be the passage matrix from one 


creation to another. In order to have Gi (s)=Gz, (s), it is sufficient that: 
2 2 


Qa =T TQOuT !, Ro=Ras Oo =TOTT, Roz = Rp. These equalities are 
automatically satisfied by the weighting matrices calculated as indicated above from 
grammians. 


Thus, SSC has the essential property for a control methodology to lead to a result 
which is independent of the choice of internal variables of the process and 
exogenous signals. 


13.4.4. Management of the control compromises 


The control parameters left to the user’s free choice are two: T, and T,. How 
should they be used? 7, controls the dynamics of the observer. The higher it is, the 
slower the rebuild dynamics of the observer and the less its sensitivity to the 
measurement noises becomes. Generally, T, , selected as higher than T, ,will fix the 
dominant modes of control. The control horizon will be sufficiently small to 
guarantee good robustness margins in terms of module margin. T, — 0 involves 
indeed R^ — 0. Consequently, based on Figures 13.11 and 13.12, it is obvious 
that the state representation of Gy, (s) (see section 13.4.2) is such that Dj,» — 0. 
The conditions of the dual LTR defined in section 13.2.4 apply in theory only if the 
condition C; =C, is satisfied. This is not necessarily true if the SSC rules are 
applied. [LAR 00]. The “LTR effect” is, however, often noted!2. To be systematic, 
in H, criterion we will have to penalize vector z=(e, e ey" with 
e, = Vo Ya if Ya is defined by y, = Cyxq ((Ug,Xq) is the asymptotic trajectory 
defined in section 13.4.1). Hence, if the process is at phase minimum in the 
bandwidth of the control, we have the guarantee of obtaining “good” robustness 
properties (module margin) at the level of sensors (system output). 





12 Minimizing [Cist A+ BKi9)'Lre |, does not necessary lead to make 


les- AB) Lexl, small unless Im(C))e Im(C;). 
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Finally, the compromises related to the adjustment of the process are managed 
from these two parameters. Nothing prevents after all to define a horizon T., thus 
making it possible to adjust the dynamics of the pursuit. This can be carried out in a 
very systematic way without questioning the structure state feedback/observer, 
which results from the separation principle and which is advisable to be used during 
the implementation of the regulator on the computer. In addition, the discrete-time 
regulator is easily obtained by applying the result given in section 13.2.7. 


Finally, if we seek to obtain a “robust” setting follow-up, during the construction 
of the CCM (see section 13.4.1), we could to add on y, a vector of disturbances d, 
of the same size as y, and such that: 


| Xq 
ne Ip @A; I 0 a 
d, Co 0 Ip ee 


(Cao: 12 ® A£) observable 


According to Theorem 13.4, this addition will guarantee obtaining a robust 
setting follow-up. 


13.5. Conclusion 


This chapter does not describe in detail SSC but it only outlines its main 
principles. General results, which may be original for some of us, were presented so 
as to lay down the theoretical bases of a control methodology using the state 
approach. Hence it should be noted the atypical presentation of H, -LTR , the 
definition and the resolution of the generalized H, problem, the relation between its 
resolution and the so-called “regulator” problem, the discretization of the standard 
H, problem, etc. The generality of these developments makes it possible to 
consider various extensions to SSC. 


If the methodological contribution of SSC is undeniable in our eyes, it is possible 
to still enrich it in various ways. It may thus be interesting in certain cases to be able 
to obtain good robustness margins at the beginning of the process rather than at its 
end and to still widen the class of systems which can be approached by SSC. It is 
also possible to introduce frequency weightings without the state approach (in the 
deep sense of the term) which is the basis of SSC. Finally, it is possible to 
specifically reduce the sensitivity of certain transfers of the looped system with 
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respect to uncertain parameters and thus to increase to a certain extent the robustness 
of the regulation. For that, we enrich the initial H, criterion of the parametric 
sensitivity of significant transfers. The optimization problem which results from this 
is in this case more complicated numerically as shown in [CHE 01]. A sufficiently 
efficient algorithm could be developed despite everything (see [YAG 011). It 
remains to polish the methodological aspect that makes these tools usable by the 
designer. 


13.6. Appendices 
13.6.1. Resolution of the Lyapunov equations 


Preliminary definitions 


Let be M and N be two matrices of size mxn and pxq respectively. 


mı iN DES Mp N 
M@N=| : th : |. ® defines the Kronecker product. 
MN qu Mma N 
T 
Vec(M) = (m; o Mm M2 CT Mm2 U Mn Y Mn) 


PROPOSITION 13.1.— the equivalence below is verified. 
P=F'P+PF+Q © Vec(P)=-[1@A! +A! QI] Vec(P)—Vec(Q) 


This equivalence can be used with gain to calculate P(t) starting from the data 
of matrices F and Q. We obtain the partial grammian of controllability for F = A7 
and Q = BBT and the partial grammian of observability for F = A and Q = C rE 


PROPOSITION 13.2.— equation F TP+ PF +Q =0 with the unknown factor P 
and for which Q is symmetrical, has the following properties: 


— F does not have eigenvalues on the imaginary axis and is a sufficient condition 
for the existence of a solution; 


— let us suppose that: 3C / Q = C TC and (C, A) is detectable, then [P 2 0> F 
is stable]; 
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+00 
T 
-F is stable= P = fe TOe dr. 
0 


13.6.2. Duality principle 


a ic A|B 
= defines the dual system of G(s):= cip. It 


System G(s)! — 
B” | D’ 
appears trivial that a system and its dual have the same modes. We also have 


without difficulty, the following relations of duality: 


(A,B) controllable <= (B',A') observable 


(C,A) observable & Aer) controllable 


Let us also consider the diagram of standard feedback in Figure 13.1 pointed out 
below. We verify without difficulty the following equalities: 


Ty * [GX] [RG] * 7, 


Consequently, “finding K which minimizes 
are two dual problems and we can infer the solution of 











| Zalh = Ty 


2 Or “finding L which 








minimizes | 
one from the other by the relation of duality L = K 
W Z W 
= LE 


G 
——b- | 














T+ rss 


Figure 13.13. Standard feedback diagram 
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13.6.3. Another useful interpretation of grammians 


Grammians are more than mere intermediaries for calculation of the A; 
standard. In robust control, they are used for model analysis and reduction. The 
controllability grammian contains useful information pertaining to the areas or 
directions of the state space which are the most “excited” by the inputs. The 
observability grammian contains similar information pertaining to the state space 
areas where the outputs are the most “sensitive”. The construction of these areas 
(contours of an ellipsoid) can be obtained on the basis of a decomposition of 
grammians into singular value (semi-axes of the ellipsoid of R” deduced from the 
values and singular vectors of the grammian concerned). Precisely, the 
controllability grammian G, makes it possible to define the area of the state space 
reachable by inputs of given energy, whereas the observability grammian G, 
defines the sub-space of initial conditions x) producing outputs of given energy. This 
interesting geometrical interpretation of grammians was provided by Moore [MOO 
81]. 


13.6.4. Zeros of a multivariable system 


A|B 
Let te) be a minimal realization of the transfer matrix G(s) tom inputs 


and p outputs. Let us assume that zọ € C is not a pole of G(s). In the case mono 
input-mono output (m= p=1), zo is a G(s) zero (finite) if and only if G(z))=0. 
In the multivariable case, the concept of zero is richer and its characterization more 


delicate. Before pointing out some results enabling this characterization, let us give a 
dynamic interpretation to the concept of zero. 


PROPOSITION 13.3— z, is a zero of transmission G(s) if there is an input vector 
of the form u(t)=uge and an initial state x(t9)=xg so that the output of the 
system is identically zero. 


THEOREM 13.6.— zo is a zero of transmission G(s) if and only if one of the two 
following propositions is verified: 


1) 4e C/rank(G(zo)) < rank(G(s)) 


A-zol B A-sl B 
2) Ase C/rank < rank 
C D C D 
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The zero of transmission zQ is called blocking zero if rank(G(z0)) =0 or, 


A-zol à 
=n 


lly, k 
equally, ran | 5 


If we tone down the minimal hypothesis of realizing G(s), the proposition 
enables the characterization of the invariant zeros which include, in addition to the 
transmission zeros, decoupling zeros (uncontrollable or/and unobservable modes of 
the realization) (see Chapter 4). 


13.6.5. Standardization of a system 


A|B 
Let us assume the system G(s):= lar Let us note by ue R”, xe R”, 


ye R” the input, state and output vectors of the system. Let N,, N, and N, be 


diagonal matrices of suitable size enabling the standardization of the inputs, states 
and outputs of the system: u,=N,u, xy =N,x, y,=N,y. The standardized 


model Gy (s) admits for realization in the state space: 
NAN | NBN, 

Gy (s) = = -| -| al 
N CN | N,"DN, 
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Chapter 14 


Multi-variable Modal Control 


14.1. Introduction 


The concept of eigenstructure placement was born in the 1970s with the works of 
Kimura [KIM 75] and Moore [MOO 76a]. Since then, the eigenstructure placement 
has undergone continuous development, in particular due to its potential applications 
in aeronautics. In fact, the control of couplings through these techniques makes them 
very appropriate for this type of application. Moore’s works led to numerous stud- 
ies on the decoupling eigenstructure placement. The principle consists of setting the 
dominant eigenvalues of the system while guaranteeing, through a proper choice of 
related closed loop eigenvectors, certain decoupling, non-reactivity, insensitivity, etc. 
Within the same orientation, Harvey [HAR 78] interprets the asymptotic LQ in terms 
of eigenstructure placement. Alongside this type of approach, Kimura’s works on pole 
placement through output feedback have been supported by several researchers. In 
these more theoretical approaches, the exact pole placement is generalized during the 
output feedback. The degrees of freedom of eigenvectors are no longer used in order 
to ensure decoupling — as in Moore’s approach — but in order to set supplementary 
eigenvalues. Recently, research in automatics has been particularly oriented towards 
robustness objectives (through methods such as the H synthesis, the u-synthesis, 
etc.), the control through eigenstructure placement being limited to the aim of ensur- 
ing the insensitivity of the eigenvalues placed (insensitivity to the first order) by a 
particular choice of eigenvectors [APK, 89, CHO 94, FAL 97, MUD 88]. It was only 
recently that the modal approach was adjusted to the control resisting to paramet- 
ric uncertainties. This adaptation, proposed in [LEG 98b, MAG 98], is based on the 
alternation between the ju-analysis and the multi-model modal synthesis (technique of 
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-Mu iteration) and makes it possible to ensure, with a minimum of conservatism, the 
robustness in front of parametric uncertainties (structured real uncertainties). 


In this chapter, we will describe only the traditional eigenstructure placement. We 
will see how to ensure certain input/output decoupling or how to minimize the sen- 
sitivity of the eigenvalues to parametric variations. These basic concepts will help 
whoever is interested in the robust approach [MAG 02b] to understand the problem 
while keeping in mind the philosophy of the standard eigenstructure placement. The 
implementation of the techniques previously mentioned is facilitated by the use of 
the tool box [MAG 02a] dedicated to the eigenstructure placement (single-model and 
multi-model case). 


The first part of this chapter will enable us to formulate a set of definitions and 
properties pertaining to the eigenstructure of a system: concept of mode and relations 
existing between the input, output and disturbance signals and the eigenvectors of the 
closed loop. We will see what type of constraints on the eigenvectors of the closed loop 
make the desired decouplings possible. Then we will describe how to characterize the 
modal behavior of a system with the help of two techniques: the modal simulation and 
the analysis of controllability. This information will allow to choose which eigenvalues 
to place by output feedback. This synthesis of the output feedback will be described 
in detail in the second part of this chapter. Finally, the last part is dedicated to the 
synthesis of observers and to the eigenstructure placement with observer. 


14.2. The eigenstructure 


In this section we will reiterate the results formulated in [MAG 90]. 


14.2.1. Notations 


14.2.1.1. System considered 
In this part, the multi-variable linear system considered has the following form: 
t = Ax + Bu 


[14.1] 
y = Cx + Du 





where = is the state vector, u the input vector and y the output vector. The sizes of the 
system will be as follows: 

















n states az € R” 
m inputs u € R” 
p outputs y € R? 


The equivalent transfer matrix is noted G(s): 


G(s) =C(sI — A) *B+D 
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14.2.1.2. Corrector 


In what follows, the system is corrected by an output static feedback and the inputs 
v (settings) are modeled with the help of a pre-control H. Therefore, the control law is: 


u=Ky+ Hv [14.2] 
where v has the role of reference input. 
If D=0: 
z=(A+ BKC)x+ BHv 


If D ¥ 0, the expressions of y in [14.1] and of u in [14.2] make the following 
relation possible: 


u=(1-KD) !KCæ+(1-KD) ‘Hv 





By substituting u in the relation & = Ax + Bu, we obtain: 
&=(A+B(1-KD) ‘KC)æ+B(I1-KD) ‘Hv 
By noticing that K(I — DK) ! = (I — DK) !K,we get: 


&=(A+BK(I- DK) *C)æ+B(I1-KD) ‘Hvw 


14.2.1.3. Eigenstructure! 


The eigenvalues of the state matrix of the looped system A + BK (I — DK)~'C 
are noted: 


Alas tear 
the right eigenvectors: 
Visens Un 
and the input directions: 
W1,..., Wn 
where (by definition): 
w=(I-KD)'KCu, & w= K(Ov; + Dui) [14.3] 


1. In this chapter, it is supposed that the eigenvalues are always distinct. 
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The left eigenvectors of matrix À + BK (I — DK)~'C are noted: 


Us: Un 
and the output directions: 
D shh 
where (by definition): 
t=ujBK(I-DK)"' © ti =(wB+tD)K [14.4] 


14.2.1.4. Matrix notations 


Let us take q vectors (generally q = p or q = n); the scalar notations À;, Vi, Wi, 
Ui, ti become: 


A 0 

A= =<, [14.5a] 
0 Aq 

WS fie. cee. v], WS a ... wq] [14.5b] 
ui ty 

Cel yi. Fil [14.5c] 
Ug tq 


If À; is not real, it is admitted that there is an index i’ for which À; = À;. Thus, 
in matrices V and W, vy = 0;, wy = Ù; and in matrices U and T, uj = ü;, ty = ti. 
In addition, when it is a question of placement, we will consider that if À; is placed, 
then Ay is placed too. Vectors u; and v; are standardized such that: 


UV =I and U(A+BK(I—DK)"!C)V=A [14.6] 


14.2.2. Relations among signals, modes and eigenvectors 


Apart from the definition of the concept of mode, the objective of this section is 
to study the relations between excitations, modes and outputs in terms of the eigen- 
structure. Knowing this makes it possible to consider the decoupling specifications as 
constraints on the right and left eigenvectors of the looped system (constraints that 
could be considered during the synthesis). This knowledge is the basis of the tradi- 
tional techniques of eigenstructure placement. However, in many cases, the decou- 
pling specifications are not primordial. In fact, it would be often be preferable to place 
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the eigenvectors of the closed loop by an orthogonal projection, this approach enabling 
us to better preserve the natural behavior of the system. 


In this section, for reasons of clarity, we will consider a strict eigensystem (with no 
direct transmission (D = 0)). The different vectors considered are: 


— the vector of regular outputs z; 
— the vector of reference inputs v; 


— the vector of disturbances d. These disturbances are distributed on the states 
and outputs of the system, respectively by Æ” and F”; 


— the vector of initial conditions xo. 


System [14.1] becomes: 


z= Ax + Bu + E'd 
y=Cz+F'd [14.7] 
z= Ex+ Fu 





14.2.2.1. Definition of modes 


Let us take the state basis change where U corresponds to the matrix of n left 
closed loop eigenvectors (see [14.5]): 


£=Ux [14.8] 


where: 

1 

gn 

The various components of this vector will be called the modes of the system. 


In [14.8] there was an obvious relation between state and mode of the system. 
Identically, the relations between excitations, modes and outputs of the system will be 
detailed, which will enable us to interpret the various specifications of decoupling in 
terms of constraints on the eigenstructure of the system. 


14.2.2.2. Relations between excitations and modes 


The input u of [14.7] is of the form [14.2]. The effect of the initial condition is 
modeled by a Dirac function 29d, hence: 


tz = (A+ BKC)x + BHv+(E' + BKF')d+ 296 
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Of: 
&=(A+BKC)z+f 


where f corresponds to all excitations acting on the system (f = BHv + (E' + 
BF'K)d + x99). After having applied the basis change (€ = Ux): 


€=AE+Uf 
We obtain: 
E(t) = e™ xU f(t) 
where “+” is the convolution integral and e^ the diagonal matrix: 
e^t = diag(eòtt,...,eò”t) 


In addition: 


t 
Eilt) = e* x uf (t) = | elu, f(r) dr [14.9] 
0 


14.2.2.3. Relations between modes and states 


By returning to the original basis, we obtain: 


w=VE=)S_ bu; [14.10] 


i=1 

This relation shows that the right eigenvectors of the system control the modes on 
the states. 
14.2.2.4. Relations between reference inputs and controlled outputs 


Here, f = BHv. Instead of considering the state vector as above, we consider the 
controlled output z = Ex + Fu. The term Ex can be written EV € and the term Fu: 





Fu = FKCgz + FHv = FKOVE + FHv = FWE + FHv 


The mode transmission becomes: 
— prit | = = Ui 
Elt) =e! xu BHv and z= D [E F] H &(t)+FHv [14.11] 
The transfers between v and £ and between the modes and z (by omitting the term 
that does not make the eigenvectors appear, FH v) can be written: 








2, [UBH] — [s1 -45| [E F] 4 a 
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We note: 
- Ey, Fp the k*t rows of E, F; 
— Zp, vp the kt! inputs of z, v ; 
— H, the kt? columns of H. 


The open loop relation between the inputs and the controlled output (by omitting 
the term that does not make the eigenvectors appear, F Hv) is given by (see [14.9] and 
[14.11] by considering W = 0): 


n t 
zp (t) => Env; J e^t- u; BH v(t) dr [14.12] 
i=1 0 


The conditions that the eigenvectors must satisfy so that there is decoupling are 
immediate: 


uBHyx =0 = vx does not have any effect on the mode £;(#) 


Egv; + Few; =0 = the mode €;(t) does not have any effect on zx 


14.2.2.5. Relations between initial conditions and controlled outputs 


The transfers between 00 and € and between the modes and z can be written: 














95 U 











+ 
+ 








(s1 -M75 [E F] ME 











Based on the notations previously mentioned, the equivalent constraints on the 
eigenstructure are: 


uizo =0 = the initial condition does not have any effect on the mode &;(t) 


Eņpvi + Fw; =0 = the mode &;(t) does not have any effect on zg 


14.2.2.6. Relations between disturbances and controlled outputs (F = 0 or F' = 0) 


The transfers between d and € and between the modes and z can be written: 




















F' W 





[u T] BA (s1-A) | lE FI wl 




















The equivalent constraints on the eigenstructure are: 


WE, + tii, =0 = d, does not have any effect on the mode €;(t) 





Egvi + Few; =0 = the mode €;(t) does not have any effect on zg 
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14.2.2.7. Summarization 


The analysis of the time behavior of a controlled system was done in the modal 
basis. Each mode is associated to an eigenvalue À; of the system in the form et. We 
have shown that: 


— the excitations act on the modes through the left eigenvectors U and the output 
directions T; 


— the modes are distributed on the controlled outputs through the right eigenvec- 
tors V and the input directions W: 








ae U,T VW 
excitations | —— | modes | —— | controlled outputs 


























We have also showed that the decoupling on the controlled outputs have the form: 
Ev; + F kWi = 0 

EXAMPLE 14.1. The graph in Figure 14.1 is used in order to illustrate the decoupling 

properties accessible through this method. The system considered here is of the 3*4 


order and has two inputs and three outputs. 


The relations linking the modes and the controlled outputs are: 


Z1 Ey Ey fi 
22| = |E2| c= | Eo} [vi v vs] |é 
23 E3 E3 §3 


Reference Modes Outputs 
signals E 


1 Zi 
V1 
62 Z2 
V2 
63 Z3 
Figure 14.1. Example of desired decoupling between inputs/modes and modes/outputs 


The decoupling constraints in Figure 14.1 are: 
— the first mode must not have any effect on zı and z3; 
— the third mode must not have any effect on z1; 


— the reference input vı must not have any effect on the third mode. 
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Hence, we obtain the following constraints: 


Ev = 0 
E3v; = 0 
Eva = 0 
u3BH, = 0 


The first two equations will be considered during the synthesis of the corrector as 
constraints on the output feedback (K), whereas the third constraint pertains to the 
pre-control (H). 


14.3. Modal analysis 
14.3.1. Introduction 


The modal synthesis consists of placing the eigenstructure of the closed loop sys- 
tem (see section 14.4). In order to achieve this, it is very important to know very well 
the modal behavior of the open loop system and the difficulties related to its modifi- 
cation. As for all synthesis methods, those that we will use in what follows are even 
more efficient if the designer has a good understanding of the system he is trying to 
control. The analysis described in this section will help him avoid in the future trying 
to impose unnatural constraints on the control law. 


More precisely, modal simulation makes it possible to generate an answer to the 
following questions: what is the influence of each mode on the input-output behavior 
of the system? Consequently, on which models is it necessary to act in order to modify 
a given output? By considering afterwards synthesis-oriented objectives, we will seek 
to have information on the difficulty of placing certain poles. This relative measure 
will be obtained by using a technique of controllability analysis. A more complete 
study on this type of analysis can be found in [LEG 98a]. 


14.3.2. Modal simulation 


This refers to the analysis of the modal behavior of a system. This type of tech- 
nique is used when we want to know the couplings between inputs, modes and outputs, 
overflows, etc. It makes it possible to evaluate the contribution of each mode on a given 
output. 


Let us consider a signal decomposed according to equation [14.12]. In this equa- 
tion, we decompose the controlled outputs z. The modal simulation can also be rel- 
ative to the measured outputs y; in this case, this analysis also makes it possible to 
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detect the dominant modes (good degree of controllability/observability, etc; see also 
section 14.3.3). For the outputs measured, we will have: 


t 
Y(t) = Cro f eu BHv(T) dr 
0 


t 
++ Ceon f eòr t-u, BHu(r) dr [14.13] 
0 


where y, corresponds to the kt? input of y. The modal simulation consists of simu- 
lating each component: 


t 
Cku; f e^ t-u; BHu(r) dr [14.14] 
0 


of the signal y(t) separately. This evaluation provides information on the contribu- 
tion of modes À; to the outputs. It also makes it possible to evaluate the nature — 
oscillating or damped — of this contribution. 


1:5 
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Figure 14.2. Example of modal simulation. 
On the left: contributions of each mode; 
on the right: overall contribution 


EXAMPLE 14.2. An example of modal simulation is given in Figure 14.2. This exam- 
ple of modal simulation is taken from Robust Modal Control Toolbox [MAG 02a]. The 
simulation is meant to illustrate the modal participation of the modes of the system to 
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a given output. A step function excitation is sent at input. On the left of Figure 14.2 
are traced the different components of the form [14.14] and on the right is traced the 
sum of these components. On this figure, we can notice that the mode —1 does not 
have any influence on the output considered, thus it will not be necessary to act on this 
mode in order to modify the behavior of this output. However, the modes in —0.1 +7 
and in —2 are very important and they will have to be considered during the synthesis. 
In addition, the modal simulation provides information on the type of contribution of 
these two modes (transient state and permanent state). The former is very oscillating 
whereas the latter is damped. This information visually (and thus obviously) illus- 
trates the fact that the modes are associated to very different eigenvalues. Based on 
this analysis, the designer has a precise idea of the modal behavior of the system and 
can decide which models to modify in order to influence the outputs to control. 





DEFINITION 14.1 (DOMINANT EIGENSTRUCTURE). We call a dominant eigenstruc- 
ture the set of pairs of eigenvalues and eigenvectors having a preponderant influence 
in terms of input-output transfer. The modal simulation makes it possible to determine 
the influence of each mode on the system’s outputs and hence to isolate the pairs of 
eigenvalues and eigenvectors with a preponderant influence. This technique could be 
used in order to determine, among the set of eigenvalues of the system, which ones to 
place by output feedback. This concept of dominant mode is even more important in 
the context of multi-model techniques discussed in [MAG 02b]. 


After dealing with the input-output modal contribution, we will now present the 
input-output controllability of each mode (corresponding to the difficulty of placing 
the modes of the system through an output feedback). 


14.3.3. Controllability 


The study of controllability is a subject that generated a lot of interest and many 
investigations were undertaken by researchers [HAM 89, LIM 93, MOO 81, SKE 
81]. After having sought to determine if a state was or was not controllable (Kalman, 
Popov-Belevich and Hautus’ traditional approaches (PBH), Grammian technique), the 
research has rapidly turned towards the study of the difficulty associated with con- 
trolling a state. That is the point of origin for the concept of controllability degree. 
Numerous researchers have explored this field by adapting the traditional concepts 
of controllability (PBH test, Grammian method, etc.). In the majority of cases, these 
techniques are based on a study pertaining to the open loop and are not relevant for 
our situation. For example, the Grammian measurement of an unstable pole is infinite 
(zero controllability) and does not reflect the fact that this pole can be controllable 
by output feedback. Through a continuity argument, the controllability measurement 
of a pole in terms of stability is erroneous due to the nature itself of this pole. This 
statement makes this type of method unusable in the context of our approaches. The 
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technique that we choose, in order to efficiently apply the methods of eigenstructure 
placement, is the technique of modal residuals analysis, which provides an instanta- 
neous criterion independent of the type of eigenvalues analyzed. Other possibilities 
are proposed in [LEG 98a]. 


Modal residuals 


The modal decomposition can be evaluated by considering the time responses at 
a given instant and for a given input. Generally, responses to an input impulse (high 
frequencies) or to a step function on the state (low frequencies) are considered. Let 
us take equation [14.13] where BH v(t) is replaced by Bô (impulse response) and 
where the measured outputs are considered; the following result is obtained. 


Behavior at high frequencies: impulse response at instant t = 0 
We have: 
yet = 0) = Crviui By +--+ + OCkUnun Bi 


The quantities Cyviu; By, i = 1,...,n are called residuals between input number 
l and output number k. The evaluation of residuals Cyv;u; B; makes it possible to find 
the controllability degree of mode 2. 


EXAMPLE 14.3. A relative controllability analysis through the graph of modal resid- 
uals is given in Figure 14.3. The impulse residuals of each mode are represented in 
this figure as a bar chart. 
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Figure 14.3. Example of analysis of input-output controllability 
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14.4. Traditional methods for eigenstructure placement 


Based on the definitions of input and output directions ([14.3] and [14.4]), the 
following lemmas can be easily demonstrated. 


LEMMA 14.1 ([MOO 76a]). Let us take À; E€ C and v; € C”. Vector v; is said to be 
placed as the right eigenvector associated to the eigenvalue A; if and only if there is a 
vector wi € C such that: 


[A-A B] H =0 [14.15] 
Wi 
Any proportional gain K that makes it possible to carry out this placement satisfies: 


Vectors w; correspond to the input directions defined by [14.3]. 


Demonstration. If [14.15] and [14.16] are verified: 
Av; + Bu; = Aivi 
and: 
wi = (I — KD)! K Cv; 
By combining these two equations, we obtain: 

(A+ BUI — KD)" KC)y, = div; 
which justifies the “if” part of the lemma. As for the part “only if”, let us consider the 
last equation written as follows: 

Ui = 
(I-—KD)1KCu;| — 
By defining w; = (I — KD) !KCv;, we have K (Cu; + Dw;) = wi, which 
concludes the demonstration of the lemma. 


[A-A B] 














By duality, we also have the following result. 


LEMMA 14.2. Let us take À; € C and už € C”. Vector u; is said to be placed as the 
left eigenvector associated to the eigenvalue A; if and only if there is a vector t} € CP 
such that: 


[ui ti] i T] =0 [14.17] 


Any proportional gain K that makes it possible to carry out this placement satis- 


fies: 
(u;B+t;D)K =t; [14.18] 


Vectors t; correspond to the output directions defined by [14.4]. 
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Parameterization of placeable eigenvectors 


The vectors satisfying [14.15] can be easily parameterized by a set of vectors 7; € 
R”. In fact, based on [14.15], the eigenvectors of the right solutions belong to the 
space defined by the columns of V(A;) € R"*™ which are obtained after resolving: 


[A-A] B] wel =0 [14.19] 


Therefore, for a column vector 7; € C™: 


Based on [14.17], the eigenvectors of the left solutions belong to the space defined 
by the rows of U(A;) € R?*” given by: 


[U(A:) TOs)] * a | =0 [14.20] 


Thus, for a row vector 7; € CP: 


ui = MU (Ai) 


14.4.1. Modal specifications 


For any type of control, one of the main objectives is to stabilize the system, if it 
is unstable, or to increase its degree of stability, if poorly damped oscillations appear 
during the transient states. Alongside this, we can try to improve the speed of the sys- 
tem without deteriorating its damping. These specifications are interpreted directly in 
terms of eigenvalue placement. As we saw in section 14.2, a system can be dissociated 
into modes. Each mode corresponds to a first order (real number eigenvalue) or to a 
second order (self-conjugated complex number eigenvalues). These modes have dif- 
ferent contributions evaluated due to the modal simulation presented in section 14.3.2, 
hence we will have the concept of dominant modes (see note 14.1). For these domi- 
nant modes, it is possible to formulate the following rules: for a desired response time 
Ta and a desired damping £a, the dominant closed loop eigenvalues must verify: 


Re(A) <0 for stability 
3 

|Re(A)| > — 
Ta 


[Re(A)| 
JA] 





> £a 
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These constraints define an area of the complex plane (Figure 14.4) where the 
eigenvalues must be placed. 











Figure 14.4. Area of the complex plane 
corresponding to the desired time performances 


Let us note that — since the control is done through power systems (closed loop con- 
trols) with limited bandwidths — a supplementary constraint is imposed by the closed 
loop modes which must be placed within the same bandwidth. Hence, it is recom- 
mended to close this field by imposing a bound superior to |Re(A)| (see Figure 14.5). 











Figure 14.5. Area of the complex plane corresponding to the 
desired performances and to the constraints on the bandwidth 
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14.4.2. Choice of eigenvectors of the closed loop 


The solution sub-space of [14.5] or [14.9] is of size? m. Hence, it is necessary to 
make an a priori choice of eigenvectors in this sub-space. Several strategies can be 
used in order to choose these closed loop eigenvectors (see below). 


14.4.2.1. Considering decouplings 


We seek here to reduce the size of V(;) to 1. The right eigenvectors will satisfy 
[14.15] and the conditions pertaining to decouplings (see section 14.2.2) are of the 
form Eov; + Fow; = 0. Consequently, vectors v; and w; are calculated by resolving: 


A— IT B Vi | 
| E Al P =0 [14.21] 


Since matrix B is of size m, it is possible to impose m — 1 decoupling constraints 
(number of rows of Eo and F5). 
14.4.2.2. Considering the insensitivity of eigenvalues 


The concept of insensitivity consists of quantifying the variation of the eigenvalues 
of a system subjected to parametric variations. This quantification is given by lemma 
14.3. 


LEMMA 14.3. Let us consider the system [14.1] corrected by the output static feed- 
back K. For a variation of the state closed loop matrix À = A + B(I — KD)~'KC, 
we have for first order: 


Ay; = u;AÂv, [14.22] 
In addition, if the variation AA of matrix A is due to variation AK of matrix K: 


and if variation AA is due to the respective variations AA, AB, AC, AD of 
A, B,C, D: 


2. It is easily shown that the size of this sub-space is equal to m if and only if A; is not a 
non-controllable eigenvalue. In case of non-controllability, the size is superior to m. Thus, the 
degree of freedom which is lost when an eigenvalue is not movable due to its non-controllability, 
is recovered at the level of eigenvector placement which offers more degrees of freedom. 
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Demonstration. By definition: 
(A+ AÂ) (vi + Avi) = (A; + AXG) (vi + Avi) 


When we multiply on the left by u; and when we simplify the equal terms while 
taking into consideration that u;v; = 1, we have: 


Let us take, by neglecting the second order terms: 
uAÂv; = Ar; 


which corresponds to equation [14.22]. When the variation of the state closed loop 
matrix is due to an output feedback variation, we can replace AA with: 


AA = B(I— KD)“!AKC + B(I— KD)“!AKD(I — KD)"!KC 
Hence, equation [14.22] becomes: 
Adj = u; B(I — KD)~'AK (Cv; + D(I — KD) +K Cvi) 
Based on the matrix identity (I — K D) ! = I + K(I — DK) !D and the 
definitions [14.3] of w; and [14.4] of t;, ie. w; = (I — KD) !KC:; and t; = 
u;BK (I — DK), equation [14.22] becomes: 


which corresponds to expression [14.23]. Let us consider now that the variations of 
the closed loop dynamics are due to the variations of state matrices AA, AB, AC, 
AD. We have: 


AA = AA+AB(I-— KD)“KC + B(I- KD)“1KAC 
+ BUI - KD) !KAD(I - KD)"'KC 


By using definitions [14.3] of w; and [14.4] of t; as before, we will immediately 
have: 











which proves expression [14.24]. 





Based on equation [14.22], the variation of the eigenvalue A; is increased as fol- 
lows: 


JAXI < AÂ: 
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In order to minimize the sensitivity of eigenvalues, we can thus minimize the cri- 
terion: 


J= $ luille 
i=l 


Let us consider all the eigenvalues and the associated eigenvectors as being real. 
Let J; = ||u,||||v:||. Based on the relation u,v; = 1, we obtain: 


1 
* cos(uj, vi) 
In addition3, (u;) = (v1,..., 05 1, Vir1,..., Un)”. Jj is thus the reverse of the 


sinus of the angle between v; and the space generated by the other eigenvectors. Min- 
imizing J thus implies maximizing the angle between the eigenvectors. 


Case of state feedback 


The objective is to calculate the state feedback Ke by placing the poles {A1,..., 
An} while minimizing the sensitivity criterion: 


J= X lello 
isl 


General methods of non-linear optimization (gradient, conjugated gradient, etc.) 
can be used in order to carry out the optimization of the criterion. In the case of state 
feedback, it is possible to use an entirely algebraic method [CHU 85, KAU 90, MOO 
76b]. It is based on the interpretation of insensitivity in terms of angles between the 
eigenvectors. 


14.4.2.3. Use of the orthogonal projection of eigenvectors 


In many applications, decouplings are not primordial. In this case, it is preferable 
to choose the closed loop eigenvectors as orthogonal projections of the open loop 
eigenvectors. 


DEFINITION 14.2. Let us consider that the open loop eigenvalue Aio is moved into À. 


Based on the notations defined by [14.19], the open loop eigenvector vio (associated 
with the eigenvalue Aio) is projected as follows: 


3. The notation () designates the sub-space generated. 
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The eigenvector and the input direction of the closed loop are thus chosen as being 
the orthogonal projections of the open loop eigenvector with the help of relations 
[14.26] and [14.27]: 


vi = VA) ni [14.26] 
wi = W (Ai) [14.27] 


Properties of the orthogonal projection 


The choice of closed loop eigenvectors by orthogonal projection of the open loop 
eigenvectors makes it possible to: 


— minimize the control leading to a desired pole placement (with a minimization 
of secondary effects such as the destabilization of non-placed poles); 


— maintain the parametric behavior of the open loop. In fact, dispersion of the 
open loop poles — when the system is subjected to disturbances — is often acceptable. 
By considering that this hypothesis is verified and by keeping in mind that the dis- 
persion of poles is closely related to the eigenvectors (see developments at first order 
[14.22]), it is natural to consider using the degrees of freedom related to the choice of 
the eigenvectors of the closed loop in order to maintain this good dispersion [MAG 
94a], as shown in Figure 14.6. Ideally, based on [14.22], we would have to choose 
the closed loop eigenvectors that are co-linear to those of the open loop, in order to 
have a A); identical in open loop and in closed loop, but these eigenvectors are con- 
strained to evolve within a space defined by equation [14.19]. Therefore, we suggest 
choosing them as being orthogonal projections of the open loop eigenvectors on the 
eigenspace solution of [14.19] in order to minimize the distance between the closed 
loop eigenvector and the open loop eigenvector in the sense of the Euclidian standard. 
This projection is done through relations [14.25], [14.26] and [14.27]; 


— proceed by continuity. We can continually move a pole towards the left by pro- 
jecting the corresponding eigenvector. 


A Re) ARe(A) 


+ 
+ 
T 





im@) d mO) g 











Figure 14.6. Shift of a set of poles with minimum dispersion 
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EXAMPLE 14.4. To illustrate these points, let us take a set of models pertaining to the 
lateral side of a jumbo jet (RCAM problem taken from [DOL 97]). The poles of the 


open loop are represented in Figure 14.7. 

















rams T8 -0.6 ae 0 


Figure 14.7. Poles of the open loop of the lateral side of the RCAM 














Figure 14.8. Poles of the closed loop with orthogonal 
projection of the eigenvectors of the open loop 


On the nominal model, we carry out the following placement: 
— 0.23 + 0.59 à — —0.6 + 0.6% 
= 1.30 — —1.30 
— 0.18 — —0.8 


by considering, for each eigenvalue placed, the orthogonal projection of the open 
loop eigenvector (associated to the open loop eigenvalue) on the solution closed loop 
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eigenspace. In practice, for each eigenvalue À; € {—0.23 + 0.597, —1.30, —0.18} 
placed respectively in A! € {—0.6 + 0.67, —1.30, —0.8}, we calculate the sub-space 
V(A:), W(A;) solution of: 


[A- XI B] Pa =0 


Then we place the closed loop eigenvectors v} associated to À! by projection of 
the open loop eigenvectors v; associated to À; on the sub-space (V(X)T,W(X:)T)T. 
This projection is done as follows: 


vi = VADO ADV AD VE (Aus 
w; = WADO ADV AD VE (Au 


Figure 14.8 represents the poles of the closed loop. We can notice that the group of 
poles pertaining to each eigenvalue placed has shifted with a minimum of dispersion 
(the isolated eigenvalues correspond to the eigenvalues not dealt with). 


14.4.3. State feedback and output elementary static feedback 


When even n or p triplets (À;, vi, w;) are placed, it is not necessary to distinguish 
the two syntheses because the procedures are the same. The formula to use in both 
cases is equation [14.16]. Usually, p (or n) triplets are placed. The calculation of K 
is done as follows: 


K = [wy +++ wp) (Clor +++ vp] + Dlr «++ ul) 


If p < n, the n — p are not disturbing if they correspond to dynamics sufficiently 
fast or negligible in the sense of section 14.3. In the contrary case, it is necessary to use 
the exact pole placement techniques or to increase the number of eigenvalues placed 
by using an observer (see section 14.5). 


In the context of our work, the result previously mentioned can be used in order to 
define procedure 14.1. 


PROCEDURE 14.1 (EIGENSTRUCTURE PLACEMENT BY STATE FEEDBACK OR OUT- 
PUT ELEMENTARY FEEDBACK). The procedure is decomposed as follows: 


1) choosing a self-conjugated group of q < p complex numbers À1,..., Ag to 
place as closed loop eigenvalues; 
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2) for each À;, i = 1,...,q, choosing a par of vectors (v;, w;) satisfying [14.15] 
Lei 


[a-ar B] |] =o 
with 0; = v; for i, j, such that À = Aji 


3) finding the real solution of: 
K (Cv; + Dwi) =w; i=1,...,q 


making it possible to place the eigenvalues {1,..., Aq} and the related right eigen- 
vectors {U1,...,Uq}. If the problem is sub-specified (the number q of eigenvalues 
placed is inferior to the number p of outputs of the system), this solution can be 
obtained by a least squares resolution (a; = (Cv; + Dw;)): 


K= [wi,..., Wr] ([or,---, ar] [ar,..-,ar]) [a1,...,a;| 


NOTE 14.1 (GAIN COEFFICIENTS BELONGING TO R). In the second phase, the con- 
dition on the conjugated term is necessary in order to be able to find a real solution. 
This is easily explained by the fact that: 


which can be written (after multiplication on the right by an arbitrary matrix): 
K (CR) Si] + DR(w:) S(wi)]) = Rw) S(wi)] 


where only real numbers are used. The group of linear constraints of the third stage 
can also be written (see [14.5]): 


W = K (CV + DW) [14.28] 


NOTE 14.2 (NON-CONTROLLED EIGENVALUES). Gain K given by this algorithm 
makes it possible to place only p triplets. This method is thus used when the n — p 
other eigenvalues correspond, in open loop, to low controllable modes or outside the 
bandwidth of the corrector, and thus will not be too disturbed by the corrector during 
looping. 


EXAMPLE 14.5. Let us consider again the lateral model of the jumbo jet described 
in [DOL 97]. We are interested in the traditional measurements of 3, p, r and 6. 
In particular we wish to decouple the requests in 3 (respectively ġ) (noted by Ge 
(respectively $.)) of D (respectively (3) (couplings inferior to one degree, for requests 
in 6e and ġe, of two and 20 degrees). These decouplings are illustrated in Figure 14.9. 
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Figure 14.9. Preferred decouplings between B and ġ 








The system has 10 states and six outputs, hence it is possible to place only six 
pairs of eigenvalues and eigenvectors. Based on the relations obtained from the flight 
dynamics, we associate three real number modes to ¢@ (modes £1, £2, €3 associated 
to the eigenvalues À, A2, 43) and a complex number mode as well as a real number 
mode to 8 (modes 4, 67, €5 associated to the eigenvalues A4, AZ, A5). Since the system 
has two inputs, it is possible to impose, for each eigenvalue placed, a decoupling 
constraint. Therefore, the output feedback will be synthesized in such a way that the 
three modes associated with ¢ are each decoupled from ( (first output) and the three 
modes associated with ( are each decoupled from ¢ (fourth output). The eigenvectors 
associated with the eigenvalues of ¢ will thus satisfy: 


A— I Bl} ju! 10 
100000 0 0] jw] J0 
The eigenvectors associated with the eigenvalues of (3 satisfy: 


A-A  B]fv:] [0 
000100 00|{w] lo 


These constraints make it possible to ensure decoupling between modes and out- 
puts. The permanent state decouplings between the settings and outputs are ensured 
by integrators. After calculating the output feedback, we note that the modes dealt 
with are correctly placed and the modes that were not dealt with are fast and not dis- 
turbed by the output feedback gain thus calculated. The responses of outputs 8 and ¢ 
to settings in ĝe and ġe are traced in Figure 14.10 (respectively equal to a two degree 
step function and to a 20 degree step function). 
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Figure 14.10. Decouplings between B and ¢ by output static feedback 


We can notice in this figure that couplings (anti-diagonal faces) remain, in both 
cases, inferior to the degree. Hence, the required decouplings have been correctly 


considered during the synthesis. 


14.5. Eigenstructure placement as observer 


14.5.1. Elementary observers 
A modal approach of the observers’ synthesis is proposed in [MAG 91, MAG 94b, 
MAG 96]. This modal approach is based on the following lemma. 
LEMMA 14.4. The system defined by (see Figure 14.11): 
Ži = m2, — tiy + uw Bu +t;Du [14.29] 
where u; € C”, ti € CP and T; € C satisfy: 
uj A + tiC = Tili [14.30] 
is an observer of the variable z; = u;æ. The observation error is given by €; = 
Ži — uix satisfying: 


Ej, = ME; 
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Demonstration. Based on [14.1] and [14.29] we have: 


Based on [14.30], we have: 











Lan? Lol 
—_—_uB +tD] 


Figure 14.11. Elementary observer of the variable z = uix 














14.5.2. Observer synthesis 


The previous lemma establishes that a linear combination of states u;æ can be 
estimated by a mono-dimensional observer and this observer is obtained with the help 
of vector u; satisfying [14.30] for a given vector t; and a complex number 7. This 
equation can be written (see equation [14.15]): 


[ui ti] à eh =0 [14.31] 


If q elementary observers are used in parallel, it is possible to represent the overall 
observer as in Figure 14.11, but by replacing u;, t; and m; with their matrix notations 
U, T and II where: 


Uy. ty 
U=|: |, T=|:]|, W=Diag{r, --- mm,} [14.32] 
tn 


c 


and where each triplet (;, u;, ti) satisfies [14.29]. These ne equations can overall be 
described as: 


UA+TC =IU [14.33] 


Here, z becomes a vector of size ne. This structure (including the output feedback) 
is described in Figure 14.12. 
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Figure 14.12. Closed loop observer 




















14.5.2.1. Parameterization of elementary observations 


Based on [14.31], vectors u; correspond to elements of the eigen sub-space defined 
by: 


we) Tem] [A | 


Hence, the elementary observers can be parameterized by a vector 77 € CP as 
follows: 


uj = 9; U (Ti) 


14.5.2.2. Choice of elementary observations 
Decoupling 


In order to choose vectors u;, it is possible to consider decoupling objectives (for 
details, see [MAG 91], dual problem of the control issue). 


Projection 


Let us consider A;9 an open loop eigenvalue and u;o its related left eigenvector. It 
is possible to dualize the projection proposed in section 14.4.2. Hence, we obtain: 


ui = mU (T) [14.34] 
ti = mT (mi) [14.35] 

where: 
ni = uio(U (m:)U (m:i) ) tU (7) [14.36] 


The choice of the projection can be justified by considering that u;ọ is exactly 
placed. In this case, the elementary observer that corresponds to this placement is 
characterized by the open loop triplet (Ajo, Uio, tio) With tio = 0. It is obvious that 
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this triplet satisfies [14.31]. The corresponding observer is represented in Figure 14.13. 
From this figure it results that the measurement vector y is not used for the observation 
of the variable z; = u;æ. Consequently, when the increased output feedback is used as 
indicated in Figure 14.12, a dynamic pre-control structure is obtained. More generally, 
when a projection is used, through a continuity argument, the pre-control structure 
becomes “dominant”. The effects of the output feedback are thus minimized. 


Sen) ef Ci 


Figure 14.13. Elementary observer of 
z = u;x where u; is a left eigenvector of A 

















14.5.3. Synthesis of output dynamic feedback in the form of observer 


When a set of elementary observers is considered, the “measurements” available 
to us are provided by y and 2. The problem of synthesis supposes seeking matrices 
K, and K, so that the system: 


z= Ax + Bu 
ż = I + (UB + TD)u — Ty [14.37] 
y = Cx + Du 





corrected by: 
u = Kyy + K,2 


has the dynamics hoped for. Due to the separation principle, this synthesis is usually 
divided into two sub-problems. 


The separation principle, mentioned below, establishes that the syntheses of the 
observer and the output feedback can be done independently: 


1) the observer is synthesized by the choice of matrices U, T, II according to 
equation [14.33]; 
2) instead of synthesizing the output feedback as u = Kyy + K,zZ specific to 


system [14.37], we synthesize it as u = Kyy + K,z (it should be noted that the 
estimation Z of z is replaced by signal z) of the following system: 


x = Ax + Bu 
y C D [14.38] 
= e+ u 
zZ U 0 

















472 Analysis and Control of Linear Systems 


THEOREM 14.1 (SEPARATION PRINCIPLE). Let us consider that: 


1) an observer of order ne is synthesized, i.e. that three matrices U € R":*", 
T € R"<*? and I € R"°*"¢ satisfying [14.33] are synthesized; 


2) two gain matrices Ky € R"*" and K, € R™*"« specific to [14.38] are 
calculated, thus making it possible to place p + nc eigenvalues and eigenvectors. 


Then, if U, T, IL Ky and K, are used based on Figure 14.12, the corresponding 
closed loop system is such that: 


1) the eigenvalues of II belong to the spectrum of the closed loop system; 


2) the p+ ne eigenvalues of the system [14.38] corrected by K, and K, belong to 
the spectrum of the closed loop. 


Demonstration. After simplifying the expression of Z (see [14.37]): 
z= Ax + Bu 
2 = I2 + UBu — TCgz 
y = Cx + Du 


Let us consider the feedback in the conceivable form (2 replaces z) u = Kyy + 
KZ. This will control the increased system: 




















x A 0 x B 
gil = u 
Zz -TC Ill |z UB 
y| |C Ojj D 
Hoe zit lol" 




















zZ 


by the feedback u = K|} ] where: 
K =(I-K,D)' |K, K] 


The equation of the closed loop system is thus: 


i-e dae DE m 


Let us use the estimation error variable € = z — U x by considering the following 
variable change: 


el-e dEl + El- zil 
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System [14.39] becomes, by using the equality UA + TC = IIU: 
£ A 0 B| ~|C 0 x 
AE elei 41) 


|- — g ex E [1440] 


or: 


The block triangular structure demonstrates the lemma: the closed loop spectrum 
will consist of the observer spectrum and the system spectrum [14.38] looped accord- 
ing to u = Kyy + Kzz. 














Let us consider the problem of the equivalent output feedback described by the 
system [14.38]. Procedure 14.1 can be applied to this system (see procedure 14.2). 
The supplementary rows of the “new matrix C” of the system make it possible to 
place p + ne eigenvectors instead of p (if ne + p > n, n eigenvalues and eigen- 
vectors are placed). The separation principle makes it possible to consider the loop- 
ing u = Kyy + K,z for the synthesis, whereas for the implementation we use 
u = Kyy + K,Z. For example, we could use this type of an observer (by choos- 
ing an order equal to n — p) in order to place all the eigenvalues of the system (but it 
is preferable to place only the dominant modes) or to artificially increase the number 
of outputs in order to deal with robustness problems (see [MAG 97]). 


NOTE 14.3 (NON-CONTROLLABILITY OF OBSERVER MODES). Based on equation 
[14.40] we note that the left eigenvectors associated to modes 7; of the system looped 
by the observer have the form [0 uci] (for a certain vector uci of size ne). Conse- 
quently, the product “left eigenvector by matrix B” is zero ([0 ui] [BT O]7 = 0), 
which immediately translates the non-controllability. 


Now we will sum up the above discussions. 


PROCEDURE 14.2 (EIGENSTRUCTURE PLACEMENT THROUGH AN OBSERVER). Let 
us suppose that we want to place q (dominant) poles and that q > p. The procedure is 
divided as follows: 


1) choosing matrices U, T, II satisfying equation [14.33]. The size ne of these 
matrices is equal to the size of the dynamic extension used for placing the q — p 
pairs of eigenvalues and eigenvectors that cannot be placed by output static feedback, 
ie. ne = q — p (in the case of placement of all eigenvalues of the system, we have 
Ne =p- Qs 

2) choosing a self-conjugated set of q complex numbers À1,..., Aq to place as 
closed loop eigenvalues; 
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3) for each À;, i = 1,...,q, choosing a pair of vectors (v;, w;) satisfying [14.15], 
i.e.: 


[A-I B] E =0 


Wi 


with v; = v; for i, j such that À = Aj; 


4) finding the real number solution of: 


[Ke Kz] G vi + a w) =w; i=1,...,q [14.41] 


If the problem is sub-specified (q < ne + p), this solution can be obtained in the 
least squares sense: 


[Ke Ka] = fs ET TT 
where: 


_ |Cv + Dwi +--+ Cv, + Dwg 


$ Uv: z Uv 


NOTE 14.4 (EQUIVALENT DYNAMIC EQUALIZER). We can show that the feedback 
calculated by the above procedure corresponds to the dynamic gain: 


Ac =I + (UB +TD)K, 
B: = -T + (UB +TD)K. 


[14.42] 
C= K; 
De = Ke 
and to the dynamic pre-control: 
Ap = Ac 
B, =UB+TD 
arc [14.43] 
Cp = Ce 
D,, =1I 


NOTE 14.5 (EFFECT OF THE EIGENSTRUCTURE OF THE OBSERVER). Based on the 
demonstration of the separation principle, the dynamics of the observer is non-control- 
lable. Let us consider & = Ax + B(u + v). It is possible to show that: 


t 
T| ontq Vi Nu(t—T)I,,. B 
H = à; I | e [ui uci] el v(rT) dr [14.44] 
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Note 14.3 underlines the fact that the non-controllability of the observer’s modes 
implies u;B+u-;U B = 0; hence, the term corresponding to 7; in [14.44] disappears. 
The eigenstructure of the observer does not have any effect on that of the closed loop 
system. 


EXAMPLE 14.6. Let us take again Example 14.4. We consider here that the only avail- 
able outputs are the measurements of the slide-slip angle 6 and of the bank angle ¢. 
These two measurements enable us to place only two pairs of eigenvalues and eigen- 
vectors through the output static feedback. That is the reason why we will use the 
procedure of eigenstructure placement in the form of an observer. This eigenstruc- 
ture — by choosing an observer of a sufficient order — will enable us to set a part or all 
the dynamics of the system. 


The system considered has p = 2 outputs and we will place the same eigenvalues 
as in Example 14.4, i.e. q = 4 eigenvalues. The observer necessary to such a placement 
must thus be of order ne = q — p = 2. We choose the dynamics of the observer 
according to objectives specific to the bandwidth of the corrector. For example: 


1.2 = —0.8 + 0.82 





Matrices U and T are obtained through the equation of Example 14.4 by orthogo- 
nal projection. 


We will place the same eigenvalues as previously, 1.e.: 


— 0.2360 + 0.59547 — —0.6 + 0.62 
— 1.3017 — —1.317 
— 0.1837 — —0.8 


with the same decoupling constraints. 


We calculate matrices K, and Ke of the observer by resolving a set of four equa- 
tions similar to [14.41]. 


Finally, in the spectrum of the closed loop, we find the modes placed by the correc- 
tor in the form of observer as well as the modes of the observer itself (separation prin- 
ciple). The non-controlled modes have a correct behavior. We note that these modes 
could have been placed in the same way by increasing the size of the observer. Fig- 
ure 14.14 shows the four modal simulations between 8e, ®. and 3, ®. We note that 
the decouplings were taken into consideration. 
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Figure 14.14. Inputs-outputs modal simulation 


14.6. Conclusion 


In this chapter, we have defined and studied the modal behavior of a system. We 
have seen how to choose the eigenvalues and eigenvectors of the closed loop according 
to the objectives: 


— on the dynamics of time responses; 
— on the couplings-inputs, outputs or modes-outputs; 


— on the local robustness to parametric variations. 


Once this choice is made, we have seen how to synthesize a static corrector making 
the placement of this eigenstructure possible. In the case of the output feedback, it 
is possible to place p pairs of eigenvalues and eigenvectors with a constant gain. In 
order to remedy this inconvenient, we have presented the synthesis of observers and 
the technique that makes it possible to directly carry out an eigenstructure placement 
in the form of an observer. Therefore, it is possible to place the totality of pairs of 
eigenvalues and eigenvectors of the system. 
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As we have said in our introduction, our intention was to describe here a synthesis 
technique in which we choose a priori the eigenvalues and eigenvectors of the closed 
loop, which leads us to a linear problem. Other techniques do not make this choice and 
adopt other policies calling upon more complex resolutions or non-convex optimiza- 
tion problems. Those interested in finding out more may refer to [APK 89, CHO 94, 
MUD 88]. 


Finally, the traditional approaches that we described are the basis of more complete 
syntheses (detailed in [MAG 02b]), that are meant, in particular, to deal with problems 
of parametric robustness. 
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Chapter 15 


Robust H../LMI Control 


The synthesis of a control law passes through the utilization of patterns which 
are nothing other than an imperfect representation of reality: besides the fact that the 
laws of physics provide only a global representation of phenomena, valid only in a 
certain range, there are always the uncertainties of pattern establishment because the 
behavior of the physical process cannot be exactly described using a mathematical 
pattern. 


Even if we work with patterns whose validity is limited, we have to take into 
account the robustness of the control law, i.e. we have to be able to guarantee not 
only the stability but also certain performances related to incertitude patterns. This 
last issue requires completing the pattern establishment work with a precise 
description of pattern uncertainties, to include them in a general formalism enabling 
us to take them into account and to reach certain conclusions. 


The synthesis of a control law is hence articulated around two stages which are 
being alternatively repeated until the designer reaches satisfactory results: 


— controller calculation: during this stage only certain performance objectives 
and certain robustness objectives can be taken into account; 


— analysis of the controlled system properties, from the perspective of its 
performances as well as their robustness. 


Chapter written by Gilles DUC. 
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The approaches presented in this chapter are articulated around these two 
concepts. 


15.1. The H„ approach 


The preoccupation for robustness, which is inherent among the methods used by 
traditional automatic control engineering, reappears around the end of the 1970s 
after having been so widely obscured due to the development of state methods. It is 
at the root of the development of H approaches. 


15.1.1. The H~ standard problem 


Within this approach, the designer considers a synthesis scheme whose general 
form is presented in Figure 15.1: vector u represents the controls and vector y the 
available measurements; vector w reunites the considered exterior inputs (i.e. 
reference signals, disturbances, noises), which can be the inputs of the shaper filters 
chosen by the designer. Finally, vector e reunites the signals chosen to characterize 
the good functioning of the feedback control system, which are generally obtained 
from the signals existing in the feedback control loop with the help of the filters 
chosen there also by the designer. 





Figure 15.1. H, standard problem 


The objective of the problem considered is thus to determine a corrector K(s) 
that ensures the stability of the closed loop control system in Figure 15.1, conferring 
to the transfer T,„(s) between w and e a norm H. less than a given level y. This 
can be defined as follows: 
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| Tew (9) ho = sup YA Coy GO) Tey jo)" ) 
oeR 


i [15.1] 
= sup 2-0 Ty (jo) 
we R 





where À designates the highest eigenvalue. 


Let us suppose that the level y has been reached. Then, by using the properties of 
norm H,, [DUC 99], we can establish that: 


— each transfer T,,, (s) between a component w; irrespective of w and a 
component e; irrespective of e verifies: 


VoeR 





Tow, J w) <y [15.2] 


— the system remains stable for any uncertainty of the pattern that would 
introduce a looping of e over w in the form w(s) = A(s) e(s), A(s) being a stable 
transfer matrix irrespective of the norm H less than 1/7. 


We can therefore use these results in different manners: 


— to impose templates to certain transfers by choosing the signals e and w, in 
an appropriate manner; if, for example, e(s) = Wi (s) z(s), where z is the output to 
be controlled and w is a disturbance, we obtain: 


vøeR |T,,(jw) < [15.3] 


Wigo) 


so that the filter W; (s) makes it possible to impose a template to the transfer T.,,(s) 
between the disturbance and the output; 


— to perform the synthesis of a corrector which ensures the robustness related to 
the incertitude of A(s) pattern marked by norm (in this case, the signals e and 
w do not correspond to the feedback control inputs and outputs but they are the 
results of an appropriated pattern establishment); 


— to adopt a combination of these two approaches. 
It is worth mentioning that, historically, the second approach is the root of the 


H œ syntheses development and gathering all the patterns uncertainties in a single 
transfer matrix A(s) is a very poor representation which leads in most of the 
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practical cases to limited results. The synthesis H. must then be seen, according to 
the first approach, as a way to impose templates to nominal patterns of the feedback 
control without being able to take into account all the robustness objectives from the 
synthesis. 


15.1.2. Example 


Let us consider a system with the input y and the control u, where the nominal 
pattern is: 


1 
Y(s) = G(s) U(s) = once [15.4] 


We want to create a feedback control in accordance with the block diagram in 
Figure 15.2, where the corrector K(s) must ensure the following objectives: 


i) the output y must be controlled over a constant reference r, with a static error 
less than 0.01; 


ii) the gain of the feedback control! must contain all the angular frequencies 
between 0 and 1 rd/s at least; 


iii) the module gain? must be at least equal to 0.7; 


iv) the gain of the transfer function between r and u must be less than 10 for all 
angular frequencies and it must decrease following a gradient of —20 dB/decade 
beyond 10 rd/s; 


v) the gain of the transfer function between r and y must be less than 0.5 beyond 


10 rd/s. 
RQ “ef Ki) | [Gy FE 


Figure 15.2. Block diagram of the feedback control 











1 Conventionally defined as the set of angular frequencies for which the gain between the 
reference r and the error is less than 1. 

2 Defined as the minimum distance between a point of Nyquist plot of the equalized system 
and the critical point —1. 
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Points i) to ili) can be translated through stresses on the transfer function 
Tey (8) =(1+G(s) K(s))~', where the gain must be: 


— less than 0.01 in steady regime; 
— less than 1 below 1 rd/s; 
— less than 1/0.7 above. 


Point iv) explicitly concerns the transfer T,,, (s) = K(s) 1+G(s) K(s)) ao 
Finally, point v) concerns the transfer T, (s) = G(s) K(s) (1 + G(s) K(s))! ; 


This brings us to construct the block scheme in Figure 15.3, where the filters 
W;(s) are chosen in accordance to these specifications. 


voj 


e2 








Figure 15.3. Diagram used by the synthesis 





1 s+0.01)1 0.7s+1 
0.7 s+1/0.7)  s+0.01 


mo- 





—1 
10 131000 = 19 5410 15.5] 


W- mas 
26) | 1+5/10 5+1,000 





-1 
s+10 2s 
W3(s) =| 0.5 = 
3(s) | kY ) s+10 


It must be noted that denominator W, (s) does not result from specifications but 
it is introduced in order to make this filter an eigenfilter: this condition is generally 
required by the resolution algorithms. 
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The scheme in Figure 15.3 is presented in the general form in Figure 15.1 by 
choosing w=r, y=€ and e=(e, e ez) . We are then going to search for a 
corrector K(s) solution of the following problem: 


Wi(s) Ta (s) 
W (8) Tr) |] <7 [15.6] 
W3(s) T (s) 


If this problem accepts a solution, we shall then have: 
; saxa 2 a A Sales ne 
Vo PGO) Ta Go} +P Go) Ty GO|” + MGO) Ty GO) <y 
which implies: 


Y 


W, GO) Tz (jo) <y & Tal jo) < —— 
Poa Fa ga 


; ; à Y 
Vo W. (ja) Tur (JO) Sf os Tur (jo) ee [15.7] 
i | | | mgo) 
y 


Mga) T Go) < 7 e |, Go) < meal 


so that the objectives will be reached if the value of y is less than 1 (or at the most 
close to 1). 


By applying one of the resolution methods which are to be subsequently 
presented, we obtain a corrector corresponding to the value y=1.029 whose 
equation is the following, after an order reduction that makes it possible to eliminate 
the useless terms (a pole and a zero in high frequency and an almost exact 
compensation between a pole and a zero): 


(s+ 1)(s +2) 
(s + 0.01)(s? +15.75 + 73) 





K(s)=71 [15.8] 
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The transfer functions obtained for the feedback control are written: 


_ (8 +0.01)(s? +15.75 +73) 








Ter(s)== > 
s~ +15.7s° +73.2s +71.7 
s+1)(s +2 
T (S) = 5 ( A ) [15.9] 
s~ +15.7s^ +73.2s +71.7 
71 
Te, (8) = 


s? +15.757 +73.2s +71.7 


Figure 15.4 shows the Bode diagram for each of these functions compared to that 
of the inverse of the filters: it makes it possible to verify that the inequalities [15.7] 
are satisfied and hence that the synthesis objectives are reached. 


In terms of robustness, the last of the inequalities [15.7] introduces a bound over 
the transfer bandwidth between the reference and the regulated magnitude: this 
ensures that the closed loop control system can tolerate high frequency dynamics 
which are not taken into account by the pattern [15.7] without risk for stability. In 
order to illustrate this idea, we suppose as an example that the pattern [15.7] does 
not consider an additional first order term at the denominator, so that a more precise 
pattern would be: 


ENE 1 
neo) te 





Acknowledging that: 





TS 


G(s) = Gs) h- ue | [15.11] 
+ 


The closed loop control system is presented in Figure 15.5a, which is equivalent 
to that in Figure 15.5b. In this latter figure, the transfer from r’ to y” verifies the 
third inequality [15.7]: 


K(j@)G(ja) 
1+ K(ja)G(Ujo) 


yY 
W3(ja)| 





Vo Ir, Go) = [15.12] 
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Figure 15.4. Bode diagrams for different transfers (full lines) 
and for their templates (dotted lines) 


We therefore infer that the closed loop control system in Figure 15.5b is stable 
for any value of 7 such that: 
; T jo Tjo 
Tyy (jo) —— 
1+7tjo 





m (Go) 
< 
Y 


Vo <1 oe Vo [15.13] 














1+7 jo 


because Figure 15.5b then corresponds to a system where the open loop (in y”) is 
stable, with a gain always less than 1: from Nyquist criterion, the close loop is then 
also stable. 
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Figure 15.5. Study of the neglected dynamics robustness 


Figure 15.6 makes it possible to compare the two functions’ Bode diagrams 
which appear in the second inequality [15.13] (W3 /y with full line and graphs for 
three different values of 7 in dotted line): we see that stability is ensured for any 
value of 7 less than 0.2. 
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Figure 15.6. Determination of a bound value of the neglected time constant 
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15.1.3. Resolution methods 


We can consider different methods in order to solve the H. standard problem. 
We therefore present the approach through the Riccati equations and the approach 
through Linear Matrix Inequalities (LMI), which are the most widely used. 


These two methods use a state representation of the interconnection matrix P(s) 
which is written in the following form: 


x(t) A | By Bu, x(t) 
e(t) |=| Ce | Dew Dey || WO [15.14] 
y(t) Cy |Dyw Dyu u(t) 





n, 
with xe R”; we R™ sue R™; ee R™; ye R”. 


15.1.4. Resolution of H. standard problem through the Riccati equations 


To solve the H. standard problem, we suppose the following hypotheses as 
being satisfied: 


H1) (A, B,,) can be stabilized and (Cy, A) can be detected; 


H2) rank(D,,)=n, and rank (Dw) =n; 


H3) VaeR rank 
D 


A-jal, a 
=n+n,; 


e eu 


Ha) Vee R tom 2 
) Voe ra c D =n+n,. 


y yw 


From a practical point of view, hypothesis H1 forces the user to choose the stable 
filters W;(s) : placed outside the loop, these are actually non-controllable by u and 
non-observable by €. In order to be verified, hypothesis H2 supposes the presence 
of direct transmissions between the controls u and the regulated variables e on the 
one hand, and between external inputs w and the measures y on the other hand. 
Hypotheses H3 and H4 are verified when the transfers P,,,(s) and Py (5) are not 
zero on the imaginary axis. 
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We present below the solution of a simplified case, which is characterized by the 
following relations: 


E 
Dey =0 Deu (Ce Den) = (0 In, ) 


[15.15] 
me (jet 
Dyw In, 


The general case is presented in [GLO 88]. We can also bring this general case 
back to the simplified one by using variable changes [ZHO 96]. 


The following theorem makes it possible in the first place to test the feasibility of 
the standard problem. 


THEOREM 15.1.— having the hypotheses H1-H4 and the conditions [15.15], the 
H „~ standard problem has a solution if and only if the following 5 conditions are 
satisfied: 


A  y°B,B} -B,BT 


i) matrix H, = 
f C1 C, -AT 


| has no eigenvalue on the 
imaginary axis; 


ii) there is a symmetrical matrix X ,, 2 0 solution of the Riccati equation: 


X A+A Xo + Xo (YB, By! —B,B,' )X0+C,'C, =0 [15.16] 


aA. -F?ClC-2 CG 


iti) matrix Jo. = A Y | has no eigenvalue on the 
- BB, -A 


imaginary axis; 


iv) there is a symmetrical matrix Y,, 20 solution of the Riccati equation: 
YAT +A + (V° CE Co -C Ce + By By =0 [15.17] 


v) p (X.Y) < y? 


where p ( ) designates the module of the highest eigenvalue. E 
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Finally, a solution for the standard problem is given by the following theorem. 


THEOREM 15.2.— based on the conditions of Theorem 15.1, a corrector K(s) 
stabilizing the system and accomplishing | Toy(s) l <y is described by the state 
representation [15.18]: 
T 
Fe e 


eo ee 
u(t) = B! Xo 0 y(t) 


4 —2 T T T 
À. = A+y By By, Xoo B, By Xe — Zoo Yoo Cy C [15.18] 





Zo = (Ip -y TX.) | = 


Thus the application of this solution consists of using firstly the results of 
Theorem 15.1 to find an admissible value of y (we can use iterations on y by 
exploring through dichotomy a range of values previously chosen). Afterwards, we 
calculate a corrector by applying Theorem 15.2. 


15.1.5. Resolution of the H. standard problem by LMI 


Synthesis by LMI provides another way to solve the standard problem. It is more 
general, since it requires only the hypothesis H1. We shall limit the exposition to the 
case when the condition [15.19] is verified: 


Dy, =0 [15.19] 


In the opposite case, we firstly solve the problem by considering fictional 
measure units p corresponding to this case and we modify a posteriori the corrector 
obtained by carrying out the change of the variable y = Du within its state 
equations. 


The feasibility of the standard problem is tested using the following theorem 
[GAH 94]. 
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THEOREM 15.3— having the hypothesis H1 and condition [15.19], the problem 
H,, standard has a solution if and only if there are 2 symmetric matrices R and S, 
verifying the following 3 matrix inequalities: 


AR+RA RC] B 


w NS 0 
CR -yl De < 0 [15.20a] 
By Dew -71 n, É 


E T 
Ne oV [45454 SB Ce | Ng o 
T 


À ES fl. D > ok | < 0 [15.20b] 
n 
Ce Dew = Yln, . 
R I 
al a) [15.20c] 
be S 


where Nr and Ng form a core basis of (Be Den’ ) and (Cy Dyw), respectively. 


Additionally, the r<n order correctors exist if and only if the inequalities 
[15.20a, b, c] are verified by the matrices R and S which satisfy the additional 
condition: 


R I 
rank lamer > rank(1,-RS)<r [15.20d] M 


n 


The matrix inequalities [15.20a, b, c], which replace Theorem 15.1 conditions 
from i) to v), are closely connected to the unknown parameters R and S: they are 
usually designated by LMI. It is easy to verify that the set of matrices satisfying one 
or several LMIs is a convex set. Specific solvers are dedicated to this kind of 
problems [GAH 95]. 
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We can additionally seek the optimal value of y by solving the following 
problem, which is a convex optimization problem: 


min y under [15.20a, b, c] [15.21] 
R=R", S=S" 


From the solutions of matrices R and S in the previous problems, we can 
consider various procedures to form a corrector: explicitly formulae are given 
especially in [IWA 94], whereas [GAH 94] proposes a resolution by LMI, which can 
be summed up as follows. 


Let: 


[15.22] 


X(t) = AcXe (0) + Bey) 
u(t) = Cx, (t) + Dy) 


with x. € R” being a state representation of the corrector of order r<n sought. 
The closed loop control system in Figure 15.1 has as a state representation: 





x At B ,DeCy BC | But Bp BD; fx rea x 

. ENTS 

že |= B.C, Ae Be Dyw Xe |= E A že [15.23] 
e Cet DeuDeCy DeuCe | Dewt Deu De Dyw | (w i “7 \w 


and, based on the “Bounded Real Lemma” [BOY 94], its norm H is less than y if 
and only if there is a matrix X = X T > 0 that verifies: 


T T 
BX -yl D |<0 [15.24a] 
Cr Dp -Yh, 


(which is a bilinear matrix inequality in X ,A.,B,,C.,D,). A suitable matrix X 
can be obtained by performing a decomposition into singular values of /,—-RS, 
from where we can infer 2 full rank matrices M,N e R"*" verifying: 


MNT=1I,-RS [15.24b] 
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which make it possible to determine: 


gah? : [15.24c] 
= x C 
NT -MRN 


where M* designates the pseudo-reciprocal of M (M*ŸM=I,). The inequality 
[15.24a] is therefore an LMI in Ao, Be, Ce, De, where the resolution then provides 
a corrector. 


15.1.6. Restricted synthesis on the corrector order 


The two resolution methods presented in the previous sections lead to correctors 
with an order equal to that of the matrix P(s), which contains the pattern of the 
regulation system increased by the filters expressing the synthesis objectives. 
However, we easily understand that this order, which can be very high, is not 
inevitably necessary to obtain a satisfactory control policy. 


The LMI formulation makes it possible to consider the synthesis H. with a 
restricted order. Let r <n be the order of the corrector sought. It is necessary to 
establish matrices R and S, which are solutions of LMI [15.20a, b, c] and satisfying 
at the same time the restriction [15.20d] (about which we can say that it is always 
verified for r 2 n): this restriction leads to the loss of convexity of the set of 
matrices solutions, but heuristic methods dedicated to this type of problem can be 
efficiently used [DAV 94, ELG 97, VAL 99]. 


15.2. The -analysis 


The seanalysis is a technique which makes it possible to study system properties 
in the presence of different uncertainties of the pattern establishment. It should be 
noted that it is no longer a matter of calculating a corrector but, a corrector being 
given, it is about characterizing the robustness it provides to the closed loop control 
system. This technique, which appeared at the beginning of the 1980s, represented a 
major progress, perceptible especially through the change in the judging manner: it 
enables in fact the description and analysis of the properties on a patterns family and 
no longer on an unique pattern about which we know that it is not capable to 
represent the set of possible behaviors of a process. 
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15.2.1. Analysis diagram and structured single value 


The u -analysis uses the general diagram in Figure 15.7 (where we can observe 
the relationship with the one used in synthesis H): all the pattern uncertainties are 
reunited in the matrix A(s); the transfer matrix H(s)— which, in the case of a 
feedback system obviously depends on the corrector — establishes a pattern for the 
interconnections between the inputs w, the objectives e and the signals v and z which 
make the uncertainties possible. 


If the transfer matrix H(s) can be anything, the situation is not the same for the 
matrix A(s), which generally has a particular structure. Typically, this matrix will 
be block diagonal and consist of, on the one hand real diagonals blocks (representing 
the parametrical uncertainties) and on the other hand, transfer functions (or matrices) 
(representing neglected or uncertain dynamic phenomena): 


A(s) = diag (1, … , 6,1,, Ay(s), … » Ag(s)} 


TT. 


[15.25] 
ER; A;(s)e RHE 


where RH 2" conventionally designates the set of stable transfer matrices of size 
nxn. Further on, we shall name S the set of all complex matrices having size and 
structure identical to those of A(s) : 


S = {A = diag 1517, ,6,1,, Ats Ag ff 
6;€R ; Ae O 


oa 
TO, 


e 


[15.26] 








Figure 15.7. Robustness analysis diagram 


In other terms, A(s) € S for all the values of s. 
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Let M be a square complex matrix having the same size as A(s) . We note by A 
the transpose-conjugated of A. The structured single value of M, of the set S, is 
defined by: 


—1 
M):=| inf AAA): det (1-A M) = 0) 
Us (M) Fins, | et ( ) as 


:=0 if YAE S det (1-AM)#0 


15.2.2. Main results of robustness 


The structured single value makes it possible to establish different results 
[ZHO 96]. Further on, we divide the transfer matrix H(s) in Figure 15.7 into: 


ee) G a an kA [15.28] 
e(s)) (Has) Ao2(s)) ws) 
with dim(z) = dim(v) = n; and dim(e) = dim(w) = n3. 


THEOREM 15.4— if H(s)is stable, the system in Figure 15.7 is stable for any 
matrix A(s) of type [15.25] so that | A(s) l <1/@ if and only if: 


voeR, us(H,,(Go))< @ [15.29] 


If, in addition, | H (s) l < $, the system in Figure 15.7 has a norm H less 
than B for any matrix A(s) of type [15.25] so that | A(s) I <1/ £ if and only if: 





VaeER, us (H(ja))< B [15.30] 


where S' is obtained by completing S by any complex matrices having the same 
size as H (s): 


S'={A’ = diag{A , Ay}; AES; Ane C"} [15.31] M 





The first result of Theorem 15.4 is clearly a result of the stability robustness with 
pattern establishment uncertainties. The second one is the result of performance 
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robustness because it guarantees that each transfer function 7,,,, (s) has a gain less 
à KGI 
than 1 for all frequencies. 


15.2.3. Example 


We consider the closed loop control system in Figure 15.8, with a constant 
corrector K(s)=2. The system to be controlled is characterized by the transfer 
function G(s) , whose nominal expression is: 


1 
G(s) = —; 1<a<3 [15.32] 
(s +a 








Figure 15.8. Studied system 


In addition, this pattern neglects the high frequency dynamics, which are 
globally represented by a time constant with a maximal value equal to 0.5 s. 


To characterize the parametrical uncertainty on the transfer function G(s), we 
firstly suppose that a = 2 + ô, with —1 < ô <1. The transfer function corresponds 
then to the differential equations: 


d 
el a +0 y, =u 
_ [15.33] 
+ +2y2 +d. =y) 
dt 
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(having for instant y, = y ). In order to separate the uncertainty 5 from the rest of 
the system, in accordance with the general diagram in Figure 15.7, we have: 


or [15.34] 
Z2 =Y35 V2 = 02 
Equations [15.33] are then written: 

d 

ae +2y; = —vı +u 

t 
15.35 
di [ ] 


—— + 2y = -v + y1 
dt 


In order to represent the neglected dynamics, by reiterating the approach 
presented in section 15.1.2, we note that a possible pattern of the system is the 
following: 





G(s) = : z F h- 2 | [15.36] 


(s+a)P{i+rs) (sta 1+7s 


with 0<7<0.5. We can contain this type of patterns within the set of transfer 
functions in the following form: 


G(s) = ee +Wa(s) Ag(s)) [15.37] 
(s + a) 


where the filter Wy (s) is chosen according to the previous knowledge of neglected 
dynamics and where A,(s) is a restricted norm stable transfer function: 


0.55 
1+0.5s 


JA; (5). = sup A; (jæ) <1 
w 





W3 (s) = 
[15.38] 


By reuniting these two pattern establishments, we can redraw the block diagram 
of the closed loop control system in the form given in Figure 15.9 (always with 
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K(s) = 2). We easily identify the matrices A(s) and H(s) of the general diagram 


in Figure 15.7 by considering v = (v; v3 vg yr and z = (2 22 Zg LE 








50 0 
A(s)=|0 5 0 [15.39] 
0 0 A(s) 
—(s +2) 2 -2s jst+2 
=D | 
i -1  =(s+2) +. 
H(s) = =—— a ! [15.40] 
s” +4s+6 =] = (s +2) LJ 
S ES OR 
-1 =(s+2) s(s+2)! 1 





























> 
Figure 15.9. Diagram used for the robustness analysis 
We can verify that H(s) corresponds to a stable system and that: 
laa = up (Ha) = 1/6 [15.41] 
In accordance with the above results, we define the sets S and S' by: 
S = {A =diag{51,,A ,};6€R;A eC} [15.42a] 


S'={A’ = diag{ô 1, ,Ay,An}:6eR;A/;eC:A»eC} [1542b] 
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Figure 15.10 shows an upper bound of us(H;,(j@)) according to œ (obtained 
following an approach which will be presented in the next section). Its value remains 
less than 0.7 for any ©. We infer from this that the closed loop control system is 
stable for any A(s) with the structure [15.39] such that: 


|5|<1/0.7 


JA, CS). <1/0.7 [15.43] 


Aol <1/0.7 <= | 


The first condition is equivalent to a condition on a, which is presented below. 
From the second condition we can infer a maximal value for 7 by noticing that: 


























: | 1 | jo 
A <1/0.7 & Vo W,(jo)A,Uo)|<— 15.44 
Aas). iGO M00 < Ta [15.44] 
If we apply this inequality to our particular case, i.e.: 
-T 
Wals) Aa (s) = —— [15.45] 
Ts+l 
we infer from it a maximal admissible value for 7 : 
2 | L2] e |r|<0.5/0.7 [15.46] 
jæ+1/t| 0.7|jæ+2 
We can finally state that the closed loop control system is stable if: 
0.572 < a < 3.428 
7 [15.47] 
0<7<0.714 


Figure 15.10b shows an upper bound of lg (H ( j@)) according to @. Its value 
remains less than 0.89 for any w. We infer from here that the closed loop control 
system preserves a norm H. less than 0.89 for any A(s) of structure [15.39] such 
that JA(s)|. <1/0.89 =1.12, which is a condition accomplished for: 


0.88<a<3.12 
| ae [15.48] 


0<7<0.56 
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Figure 15.10. Upper bounds of the structured single value 


15.2.4. Evaluation of structured single value 


The calculation of the structured single value is recognized since the beginning 
as a difficult mathematical problem (except for simple cases, it is part of the 
problems with non-polynomial complexity). Nevertheless, we know how to search 
various upper bounds by solving optimization problems under LMI constraints. This 
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means that the following sets of matrices, whose structure is inferred from that of 
the set S: 


D = diag \D,,--,D,, AL) 05d gl 
D= (Di Dys dil aln J [15.49a] 
D,e C" ; deR 
D ={DeD; D=D*> 0} [15.49b] 
H 
G = diag \ G}; G, O, =, 0 
G= l à% D n) [15.49c] 
G; € Cc" n x Gi =G; 


The set D consists of matrices which can be substituted by any matrix of S: 
DA = AD; the set Dy is formed of Hermitian matrices positively defined by D; the 
set G is formed of Hermitian matrices (not necessarily defined) such that for any 
matrix of S: GA =A*G: in fact the single non-zero blocks of matrices G 
correspond to real blocks of A. 


We then demonstrate the following results [YOU 95, ZHO 96]. A first upper 
bound is obtained by using only the matrices D: 


us(M)< y = min J21D 'M*D?M D! [15.50a] 
S 1 D 


DE 


This first upper bound can be calculated by solving the following optimization 
problem: 


y = min y,; = min Y under the constraints: [15.50b] 
De D De Dy 


y 20 [15.50c] 


M*DM-%°D<0 [15.50d] 
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If the matrices A(s) contain real blocks, a more precise upper bound is obtained 
by using in conjunction the matrices D and G: 


us(M)< y,” with [15.51a] 
Yo = min 7, = min y, under the constraints: [15.51b] 
DeD De D, 


GeG GeG 


y >0 [15.51c] 


M*DM + j(GM-M'G)-y"D<0 [15.51d] 


The interest of these formulations is that, with fixed y, and y3, the inequalities 
[15.50d] and [15.51d] are LMIs, in D or in D and G respectively. The calculation of 
Yi” and y2” can be performed by using regulators dedicated to this type of problem 
[BAL 93, GAH 95]. 


The approach usually used to perform a 4#-analysis consists of searching an upper 
bound of us (H 11( j®)) or Ug (H ( jæ)) for a previously chosen set of values for 
æ . If these functions are regular enough (which unfortunately is not always the case 
[PAC 93]), we obtain quite easily upper bounds of their maximum: based on the 
results in section 15.2.2, this is the information we need in order to conclude on the 
issues related to stability robustness or the performances robustness. This approach 
has been used in the example of section 15.2.3. 


15.3. The u-synthesis 
15.3.1. A H „robust synthesis 

In an interesting manner, the y-synthesis combines the two previous approaches 
by searching an answer to the following problem: can we determine a corrector 
which guarantees that the norm H. of a closed loop control system remains less 


than a given y level, this system being submitted to different pattern uncertainties? 


In order to approach this problem, we consider the block diagram in Figure 
15.11, which combines the diagram of H „ standard synthesis (Figure 15.1) and that 
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of u-analysis (Figure 15.7). As in the previous section, the uncertainties A(s) have 
the general structure: 


A(s) = diag [81], , 6,1, , A(S), ». Ag(s)f 














[15.52] 
ô; € R 5 A;(s)e RHE” 
z] AG) vV 
ety E E cee E a ye ee L] 
Pe <q 
e P(s) |}~<—1—— w 
S i 
i i 
1 LV uj: 
K(s) ~ | 
Figure 15.11. The problem of robust synthesis 
and we shall suppose that each uncertainty has a norm bounded by 1: 
6,¢€|-1;41|,7=1,r À l eu 
JA) birg 7 BOL < es 


Moreover, by supposing that the level y to be satisfied is equal to 1 (we can 
always return to this case by integrating the value of y in the matrix P(s)), the 
problem is the following: to establish a corrector K(s) so that the norm H. of the 
transfer of w toward e should be less than 1 for any A(s) of type [15.52] such that 
Aw], < 1. 


Let T(s) be the transfer between (v w)! and (z oT of the closed loop system 


through K(s) (Figure 15.11). Based on the results in section 15.2.2, this property is 
verified if and only if: 


VER, ug (T(ja)) <1 [15.54] 
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where the set S' is defined as in [15.31]. Establishing a corrector verifying [15.54] 
is called a synthesis problem [ZHO 96]. Unfortunately, except for simple cases, it 
has no solution known nowadays. Therefore, we have to try to solve it by using an 
alternative method. 


15.3.2. Approach by D-K iterations 


However, based on the results in section 15.2.4, an inequality of the following 
type is verified for each value of œ: 


Us (T(jo)) < 77, with [15.55a] 

pie nur Ya under the constraints: [15.55b] 

Yo 20 [15.55c] 

Tjo)? D,T(jo) - Yo Doy < 0 [15.55d] 

and DY = | poe es ) | [15.55e] 
i ; di 


The set D' is formed of matrices which can be substituted with any matrix of 
S'. A more realistic problem consists of searching a corrector K (s) and a stable 
transfer matrix D(s) so that its reverse D(s) | is stable and can be substituted with 
any matrix of S', such that: 


|2% T(s) Dis)" <1 [15.56] 
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In fact, condition [15.56] will ensure that: 





Vo uş (T(j@)) < Yo = ALL JAD Te joy! D? Tia) Dy") 





< JA(Djo)y! T(-ja)" DGO TGO) DGO!) 


< | D(s) T(s) D(s) ! |. <1 


Let us note that the constraint of having D(s) and D(s)™' as stable is necessary 
in order to have a corrector which stabilizes the transfer matrix which appears in 
[15.56]. 


Once more, this problem has no generally known solution, but we can try to 
solve it by searching alternative matrices K(s) and D(s). In fact: 


— calculate fixed K(s) to D(s) is nothing else than a problem of synthesis H., 
corresponding to the block diagram in Figure 15.12; 


— with fixed K(s), the search for D(s) can be conducted by calculating the upper 
bound [15.55] for a set of previously chosen values of œ and then by interpolating 
the matrices Dp obtained by a stable and inversely stable transfer matrix. 


These two steps are repeated until the convergences of matrices D,, or the 
fulfillment of condition [15.54]. This procedure is named D-K iteration. 











Figure 15.12. 1, standard problem solved during D-K iterations 


We note that if the calculation of K(s) on the one hand and the calculation of 
each matrix D, on the other hand are perfectly solved problems, the procedure 
convergence is ensured only if D(s) perfectly interpolates the matrices D,,. In 
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practice, we are limited to a rough interpolation, meaning that we lose any guarantee 
of the convergence of the D-K iteration. 


We should also note that the H. synthesis which provides K(s) uses an 
interconnection matrix Pp(s) which contains the matrices D(s) and D(s)' (Figure 
15.12): the corrector order obtained by the Riccati equations (section 15.1.4) or by 
LMI (section 15.1.5) will then be equal to the Pp(s) order, which increases along 
with the order chosen for D(s) during the interpolation of matrices Do. 


This procedure is obviously heavier insofar as the number of uncertainties taken 
into account in A(s) is significant. An intelligent use of this technique consists of 
using it with a limited number of uncertainties and taking care to choose those which 
are the most penalizing for the synthesis. After that, we shall perform a much more 
refined analysis of robustness, the U-analysis procedure developed within section 
15.2 not presenting the same inconvenient. 


15.3.3. Example 


Let us recall the example presented in section 15.2.3 in order to illustrate the u- 
analysis. We had established that having a constant corrector K(s) = 2, the 
robustness analysis tracked by using the diagram in Figure 15.9, made it possible to 
guarantee an H. norm less than 0.89 for any value of the parameters verifying 
inequalities [15.48]. 


We want to enhance this result by calculating the corrector K(s) by u-synthesis. 
For that, we will apply the D-K iterations procedure by identifying the transfer 7(s) 
which used to appear during the development of the previous section at the transfer 
H(s) in Figure 15.9. The matrices De which intervene in the synthesis procedure 
will be contained in the following set D’: 


D'={D= diag {d,,d>,d3,1} ; de C} [15.57] 


Let us note that we choose the diagonal matrices to limit the number of transfers 
to determine at the moment of interpolation (whereas the presence of a repeated 
uncertainty Ô J, would authorize to replace the first two elements with a plain 
matrix 2 x 2). Additionally, the procedure does not make a difference between real 
and unreal uncertainties. These two remarks show that the family of the patterns to 
be considered forms an over-set of those in which we are directly interested. 


The D-K iterations have been conducted by choosing to interpolate each element 
of the matrices D,, by a first order transfer function and by using the software 
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[BAL 93]. The maximum of the ws: (H ( jo)) upper bound, calculated in 
accordance with the selection performed in [15.57] by considering four complex 
uncertainties, is established at the end of four iterations on an order value of 0.72. 


By recalculating Us (H ( j@)) as in section 15.2.3 (i.e. with one real repeated 
uncertainty and two complex uncertainties), we obtain a hardly different value, 
namely 0.697 (Figure 15.13.a). We infer from this that the closed loop control 
system preserves an H. norm less than 0.697 for any value of the parameters 
verifying the inequalities: 


0.57 <a < 3.43 
[15.58] 
O<T<0.72 
The matrix D(s) obtained after the last iteration is given by: 
Day= ddk 0.685 s +1.038 ; 19.79 s +36.37 : 30.185 +8.36 4 [15.59] 
s+0.860 s+40.72 s +41.68 


The obtained corrector is of 9" order (equal to the sum of orders of G(s), Ws), 
D(s) and D(s)'), but it can be easily reduced to a 2" order transfer function whose 
expression is: 


1+5/1.062 


K(s)=1.14 
(5) Ü+5/68.13)1+5/100) 


[15.60] 





The Bode diagram for this corrector is given in Figure 15.13b. 


15.4. Synthesis of a corrector depending on varying parameters 
15.4.1. Problem considered and L, gain 


The use of a fixed corrector, even “robust”, is not conceivable for a process 
whose parameters vary strongly or rapidly. In this section, we shall suppose that the 
process to be controlled can be described by a linear system with variable 
parameters (LVP system), whose general form is the following. 
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b) Bode diagram of the corrector 


Figure 15.13. Results of 4-synthesis 


x(t) AQ) | ByO) By) | ( x 
P(A): | e@) |=] Co) | Dol) Deu(9) | | WO 
70) (C0) | Dyl) Dy (A) ) (UO 


[15.61] 





where Ot) = (6,(t), OO, …, 6, he is a vector with time depending parameters, 
each component of O(t) having the possibility to be measured in real-time. This last 


Robust H,/LMI Control 509 


hypothesis enables us to search for a corrector K(@) in the same form (Figure 
15.14). 





Figure 15.14. System and corrector depending on parameters 


It is suitable first of all to define a measure for the performances to be reached. 
For this, we can generalize the concept of H. norm (which is applicable only to 
invariant systems) in the following way; let L,(R”) be the set of signals s(t) with 
R, in R” whose L, norm: 


Is], = 45 ss dt [15.62] 


is limited. The L, gain of the system in Figure 15.14 is then defined by: 


lel 


wl 





y= [15.63] 


w(t)e L, 








The synthesis problem we are going to consider is to establish a corrector K(0) 
depending on the parameters which ensure the stability of the closed loop control 
system in Figure 15.14 for any possible evolution of O(t), while providing to the 
closed loop control system a L, gain between w and e less than a given value y. 
Further on, we consider two versions of this problem corresponding to two pattern 
establishments different from the process and from the corrector. 


15.4.2. Polytopic approach 
Let us suppose that each parameter 0; (t) could have any value within a range of 


type [9; ; 8i]. The vector O(t) can then have any value within a section of R? . 
We shall note by 7; , i =1,---,2” the peaks of this section. If equations [15.61] of 
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the system to be controlled are connected in O(t), each matrix of its state 
representation evolves within a “polytope” whose peaks are successively obtained 
by considering O(t) = 2; ,i=1,---,2”. More specifically, at any moment f, the 
vector O(t) can be expressed as a barycenter of the peaks 7;: 


2? 2? 
i=l i=l 


and the state representation matrices [15.61] are expressed based on the same 
coefficients œ; : 


AO) | BaO) BaO) y | AG) | Bw(ti) Bult) 
C.(9) ae Deu) |= Eai | Celti) | Dei) Deu (Ti) | [15.65] 
Cy (0) Dyy() Dy, (9) g Cy 0) Dyy(%;) Dy, (ri) 








It then seems possible to search for a corrector: 


Ko: (2) [4x O | Be) [O eee 
(uO J (KODA) (ro | 





whose state representation matrices are expressed once more based on the same 
linear combination: 





(ae o) E Bi | a [15.67] 
i=l 


Cx (0) | Dx (6) Cx (a;) | Dg (7) 


Further on, we shall suppose that the following hypotheses are verified: 
HS) D,,,(@) = 0 or in an equivalent manner D,,,(7;) = 0 for i =1,---, 2P; 


H6) B,(8), C (0), De,(@) and D,,,(@) are independent of @, or in an 
equivalent manner: 


B,(4;) = By Cyi(4) =C, Dey (Ti) = Deu > Dpwh) = Dyy st =1,..,2P 
[15.68] 
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H7) for any possible evolution of O(t) within the top peaks z;, (4(8), B,,) can 
be quadratically stabilized and (C y» A(0)) can be quadratically detected, or in an 
equivalent manner there are real symmetric matrices X > 0 and Y > 0 satisfying 
respectively the followings LMIs: 


N,’ (ATX+XANN, <0,i=1,..,2P [15.69a] 
Ny’ (4Y +YA; Ny <0,i=1:.,2P [15.69b] 


where N, and N, form a base of the cores of B; and C, respectively. 


The feasibility of the problem presented is tested by using the following theorem 
[APK 95a]. 


THEOREM 15.5.— having the hypotheses H5, H6, H7, there is a corrector 
stabilizing the system in Figure 15.14 and ensuring an L, gain less than y for any 
possible evolution of @(t) within the top peaks ī;, if and only if there are two 
symmetric matrices R and S verifying the following three matrix inequalities: 


à r| ATR + RA) RC) BG) | o 
| i I | Ce(r;)R = Ya, Dew(Zj) | I | < 0 
ny Ny 
By) Dey (i) E Yn, 
i=1,-+-,2? [15.70a] 
T| AG) S + SAT) SBy(a;) Cola)" 
Ns T T Ns 
By (7) S z VIn Dew(T;) <0 
0 J, w 0 1, 
‘ C.(7;) Dey (Aj) E Ya, j 
i=],...,2P [15.70b] 


R I 
| à > 0 [15.70c] 
I, S 
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where Nr and Ng form a base of cores of (BT Deu) and (C, Dyw) 
respectively. E 


Inequalities [15.70], calculated on inequalities [15.20.a, b,c] of the H% 
synthesis problem are LMIs in R and S. We note that, unlike the H. problem, the 
first two inequalities are both replaced by a system of 2” inequalities which must be 
simultaneously verified by the same matrix R or S. 


After the resolution of this inequality system, the construction of matrices 
Ax (Ti), Bk (ti), Cx (Ti), DKx(x;) can be done, for every peak 7;, by following 
the approach presented in section 15.1.5 for the construction of the H. corrector. 
The final corrector is hence obtained by the formula [15.67]. It is actually dependant 
on the evolution of A(t) because, at the moment J, we have to infer from O(t) the 
values of the coefficients @;, from where we infer the corrector matrices. We 
understand that this operation can become tedious if the number of variable 
parameters taken into account within the synthesis is increased. 


15.4.3. A more general approach 


Let us take again as departing point equations [15.61]. When the different 
matrices of this state representation depend on O(t) in a rational manner, this 
system can be presented like in Figure 15.15, where an invariable system described 
by the matrix P(s) is looped by a matrix © (t) isolating the parameters: 


O0) = diag (O Un,» (On, Op On, ) [15.71] 


This operation is similar to the one necessary in order to perform a u-analysis. 
We can search for this system a corrector in the same form, i.e. presented as the 
looping of an invariant system of transfer matrix K(s) and of a response of matrix 
O(t) (Figure 15.15). 
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Figure 15.15. Structures of the system and of the corrector 


Without being any less general, we can always suppose that the elements 
contained into @(f) have been standardized, so that: 


|6,)|<1 Vi=1..,p [15.72] 
Further on, we shall note the state representation of P(s) in the form: 


x(t) A | Bo Bw Bu \( x) 
t Ca | D D D t 

RA N | a |e [15.73] 
e(t) Ce | Deg Dew Deu w(t) 


y(t) Cy De Dy Dyu u(t) 





and we shall suppose as verified the following hypotheses: 
H8) Dy, =0; 
H9) (A, B, ) can be stabilized and (C,, A) can be detected. 
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As in the case of p-analysis, we define a set of matrices calculated on the 
structure of the matrix @ (t): 


Ly ={L = diag {Lss Lp }; Le RW; L=U >of [15.74] 


The feasibility of the problem presented is tested using the following theorem 
[APK 95b]. 


THEOREM 15.6.— having the hypotheses H8, H9 there is an corrector stabilizing 
the system in Figure 15.15 and ensuring an L, gain less than y for any possible 
evolution of O(t) verifying [15.72] if and only if there are two pairs of existing 
symmetric matrices (R,S)ER" xR" and (L,J)eLyxLy, verifying the 
following three matrix inequalities: 


RA LAR RCo RCS? HET B, 


Ne 0 T Co R -J 0 Deo J De, Np 0 
Co R 0 -yI Də J Dey <0 
0 I à 7 z 0 I 
J Bo J Dog J Deg aJ 0 
Br DT -D,a 0 
w Ow ew -yI 
[15.75a] 
SALATS SBọ SB, CọL Cf 
T E T T 
Ny 0 T Hose L 0 Do d Peg Ny 0 
0 7 By S 0-71 Dw L Dew || 9 |<? 
LC L Dog LDy -L 0 
Ce Deo Dew 0 —yI 
[15.75b] 
R I 
>0 [15.75c] 
I S 
L I 
>0 [15.75d] 
IJ 


where Nr and Ns form a base of cores of (B,7 Da Dex’ ) and 
(C, Dyo Dyw) respectively. 
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Based on matrices R, S, L and J, a corrector responding to the problem can 
be built following an approach similar to that presented in section 15.1.5 for the H 
corrector [APK 95b]. 


This second approach covers a range of systems wider than that of the polytopic 
systems because the dependence in @(f) is supposed to be rational (and not 
necessarily connected). Moreover, it does not require expressing @(f) at each 
moment as the baric center of different peaks of its evolution field. On the other 
hand, it requires putting the system in the form of Figure 15.15. The resolution uses 
two additional matrices, but the number of LMIs to be verified is smaller. 


15.4.4. Example 


We consider a system described by the following state equations: 


Or AE ee. i 
O6 -s0015 Ot 4206/0015)” .0.1<8<2 [15.76] 
z() =(1 Ox 


This system corresponds to an integrator followed by a first order transfer whose 
time constant varies between 7.5 and 150 ms. 


Taking into account the range of values of ô given below, we are going to 
search for a corrector which minimizes the L) gain of the system described in the 
block diagram in Figure 15.16. On this appear two exogenous inputs w,, w which 
act, one in the system’s output (and which represents, for example, a setting input) 
and the other at the input and two outputs e}, e> which correspond to the feedback 
error and to the control (this type of problem is rather usual within H. synthesis 
[DUC 99]). 








ee 


| Corrector | | Corrector |< 








Figure 15.16. Synthesis problem considered 
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The system being polytopic, we can apply the approach of section 15.4.2, by 
defining the peaks 7, and 7, from two extreme values of 6 . Certainly, it does not 
verify hypotheses [15.68] but we can modify the problem by adding at the system 
input a filter of negligible constant time T'ẹ : 


0 4/7 0 0 
X(t)=|0 -6/0.015 4206/0.015 |x)+| 0 O 
0 0 UT, 1/T, pe 


z(t)=(1 0 0)x(t) 


Further on, we have considered T, =0.1 ms. The minimization of y under 
constraints [15.70], based on the software [GAH 95], leads to the value y =9.37. 
The correctors corresponding to each peak are given by: 


3.34107 -3.37107 1.8410° | —2.2510° 
Ax, | Bx, -~6.0610° -3.46104 -3.3710 | -37.8 
PRO -2.1710 -2.4410 | 2.34105 





— 0.600 -2.41 — 32.8 


3.9310? —6.6510° 9.5510° |—2.2510° 
Ge vi —5.7410* -3.4710* -3.1810 | -37.8 


[15.78] 


~ | -8.0810* 1.96105 —2.8710° | 2.34105 
— 0.600 -2.41 -32.8 0 








Cry | Dr 





The parameter 6 is expressed according to the peaks 0.1 and 2 by the 
expression: 


5=0.la+2(l-a) & a=— [15.79] 


the parameterized corrector in ô has the following state representation: 


Ax | Bx 20 Ax, | Bry 4 01 Aka | Bro [15.80] 
D) 19 (Cx | Dr,) 19 (xy | De, 
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Figure 15.17a shows the response of the output z(t) and of the control v(t) to 
an amplitude interval —1 on the input w, the parameter ô evolving throughout this 
response in accordance with the relation d(t) =1.05+0.95 cos(50f). We note that, 
despite a very brutal variation of ô during this transient, the response is very well 
damped and the control is very smooth. 


Figure 15.17b shows the output in response to the same input signal, this time for 
constant values of 5, which are contained between 0.1 and 2. We note that the 
various responses are relatively homogenous despite this strong parametrical 
dispersion. A synthesis of a fixed corrector, even with all issues identical, does not 
make it possible to obtain this result. 


Amplitude 
1.25 
14 
0.8- 










Output 


Control 
aa 











0 0.05 0.1 0.15 0.2 
Time (sec) 


a) Response with parametric variation 
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1.24 

1 
0.8 5 
0.6 4 
0.4 
0.2 - 

















-0.2 T T 
0 0.05 0.1 0.15 0.2 
Time (sec) 





b) Fixed parameter response 


Figure 15.17. Unit-step responses with LVP corrector 
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15.5. Conclusion 


In this chapter, we noted that the concepts inferred from the H. norm connected 
with the matrix inequalities make it possible to implement very strong analysis and 
synthesis tools. The developments presented here have many extensions within the 
contemporary works. See [DUC 98, DUC 99] for a more complete bibliography on 
this subject. 
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Chapter 16 


Linear Time-Variant Systems 


The complexity of the physical phenomena studied cannot be reduced to only 
one modeling by linear dynamic systems with constant coefficients. These models 
are sometimes poorly adapted because, for example, they can only deal with 
magnitudes having an exponentially decreasing correlation. However, in fields full 
of variety such as hydrology [HUR 65], electronics [VAN 88], traffic [RIE 97, WIL 
95], electrical engineering [CHA 81] and mechanics [CLE 98, ZHU 96], there are 
many situations that generate behaviors that do not obey these quite simple models. 
Therefore, in the last 30 years, new analysis models and tools have appeared. In this 
perspective, one of the goals of research is to extend the class of linear dynamic 
systems by including those for which the coefficients vary in time. These variations 
can be divided into two classes. The first class concerns the sudden non- 
stationarities or “failures” characterized by time intervals where the coefficients are 
constant. The non-stationarity is due only to the presence of instantaneous shifts in 
their values. This modeling is found, for example, in the field of monitoring and 
diagnostic [BAS 93]. As such, the problem is to essentially detect the instants of 
change as well as the amplitude of the parametric shifts. The second class pertains to 
the systems where the coefficients are functions of time. When these dynamics are 
“slow” with respect to those of the system, they can be dealt with through adaptive 
techniques. 


However, there are also many cases in which the evolution of parameters is 


“fast” (T-periodic systems [RAB 92], auto-similar systems [GUG 01], etc.). This 
last category makes the development and the implementation of specific methods in 
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both the control and identification fields indispensable. Firstly, it is important to 
have the basic mathematical tools indispensable to their analysis. 


This chapter is dedicated to the analysis of the dynamic systems required by the 
linear differential equations with time-variant coefficients. The approach presented 
consists of an approach parallel to that adopted for the constant coefficient systems. 
The Laplace transform, even if it can be always applied to the input/output 
magnitudes of the system, can no longer be used in order to define the transfer 
function of these systems. 


However, this transfer concept can, despite everything, be extended to the non- 
stationary linear differential systems provided they operate on the non-commutative 
body of rational fractions. This body is isomorphic to the group generated by the 
non-stationary linear dynamic systems. Hence, we can elaborate the composition 
rules of these systems with the help of the algebraic rules applied to the transfer 
functions. The results obtained can be used in order to solve control or/and 
identification problems. 


This chapter deals, in the first place, with the construction of the non- 
commutative polynomial ring and of the body of related rational fractions. As in the 
traditional case, the relation between the basic properties of dynamic systems 
(stability, etc.) and the characteristics of the elements of the body of fractions (poles, 
etc.) can be established. The second part pertains to the construction of the systems: 
serialization or/and parallelization. It is possible, for each association diagram, to 
write the transfer function of the system composed with the help of simple algebraic 
rules. Finally, based on these results, two applications illustrate the use of the results 
obtained. The first one concerns the modeling of multi-component polynomial phase 
signals and the second is dedicated to the design of a pole placement control law. 


16.1. Ring of non-commutative polynomials 


Let TI(à) be the set of polynomials of degree n: 
TIA) = {PO = a, ON + a, A"! +... + aola AEK Vie [Ln] } 


Let K be a differential body, i.e. a body on which is defined a derivation operator 
da/dt\/a(t)€ K (noted from now by @ ) which will satisfy the traditional derivation 
properties: 


(a+b)=à+b and ab=ab+ab 
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when coefficient a„(t) is equal to 1, polynomial P(A) is standardized. 


The set II(A) including addition and multiplication which satisfies: 


Va(t)€ K, Aa(t) = alta + a(t) 


has a non-commutative ring structure [ORE 33]. 


16.1.1. Division and the right highest divisor (RHD) 


V BA), P(A)ETI(A) © MA) 


insofar as np 2np, (where ny represents the degree of X(A)) there is a unique 


pair of polynomials [Q(A), R(A)] so that: 
P(A) = Q(A) Px (A) + RA) with ng < Np 
We can infer Euclid’s division algorithm: 
P(A) = QA) P(A) + PQ) 
Ph (A) = On-1 (A) Pn—2 (A) + Pa (A) 


The RHD of PA), P(A) is then defined as the standardized polynomial 
resulted from P, (A). 


16.1.2. Right least common multiple (RLCM) 


It is then possible to define the RLCM of P(A), P(A) as the lowest degree 
standardized polynomial divisible on the right by both PA (à) and P(A). 


MO) = QA) P(A) = QA) PA) 
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Generally, the existence of the Euclidian division implies the existence of the 
RHD [ORE 33]. 
16.1.3. Explicit formulation of RLCM 


Based on all the factors of the Euclidian division, it is possible to express the 
RLCM: 


MO) = AP, AUP, AT P,2 Mar. [AAI Ar AAI 


The constant o(t) (which does not depend on À) is such that polynomial MA) 
is normalized. 


NOTE. the product of two polynomials A(4)P,(À) cannot generally be divided on 
the right by P,(A) which makes the use of all the terms of the Euclidian division 


compulsory in the expression above. 


M(A) is a polynomial and writing it in the inverse form is basically a useful 
notation. 


By applying the same approach, we can define the left highest divisor and the 
left least common multiple. 
16.1.4. Factoring, roots, relations with the coefficients 


Any polynomial P(A) can be factorized in the general form: 
P(A) = AOA = py(t))(A = pa)... À = Pr-O) = pat) 


The roots of P(A) are provided by the solutions of equation P(A) = 0. 


The relation between the roots and the coefficients of P(A) is a non-linear 
differential equation [KAM 88]: 


nal 
8" pa) + D a OS | py) +a, OST p, (0) + ag) = 0 
2 
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where S is the operator defined by: 


SPa (© = PZO + Pn) 
Sip O= p (OS +S P vi > 2 


EXAMPLE 16.1.— let us consider the second degree polynomial: 
PO) =} + a (A+ alt) = A- PDA- p20) 
From the following relations: 
Pi) + p2(0) =—ay(1) and PiP- Pa) = ag) 


we infer that the roots are solutions of: 


p3(t)+ py(t) + a Opt) + ag(t) = 0 





Particular case: a(t) = ap()=0 > PA)= xX =0 








> Pa2(0) = ; ; VkeR 


1 
=> p(t) = t)=0 and t=- 
P216) = p2 (6) and p(t) ; E 


+k 


Hence, the factoring of P(A): 


PA) = À spas À VkER . 
t+k t+k 


16.2. Body of rational fractions 


In general it is not possible to define a body of rational functions from 
polynomials for which the unknown factor and the parameters cannot be switched 
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(skew). However, if the polynomials verify the two following conditions called ORE 
[AMI 54]: 


Y BA), (Ae TIA) 817(2) 


à E i R (condition on the left) 
3 ÅM, ÊA) so that À (22 (2) = ÅWPA) 


and: 


Y P(),PB(@)e TA) ® TWA) 


3 3 z z (condition on the right) 
1 P(A), P(A) sothat P (AP (A) = P (A)P, (À) 


it is possible to consider the set: 


F(A)= [Paga .VP(U)EIT (À), VO(A)ETT(A) }= 
{or PA) VP(Del(), VOU)eET (A) } 


where IT’ (A) = II(4)-{0} which has a body structure [ZHU 891. 


NOTE.- the inverse of polynomial P(A) is unique and verifies: 


PAPELA) = PP (A) = 1 


16.3. Transfer function 


In the case of linear systems with constant coefficients, the transfer function is 
defined as the ratio between the Laplace transforms of the output and those of the 
input. When the coefficients are time functions, it is possible to extend this concept 
of transfer function which preserves certain properties obtained in the traditional 
case even if it does not represent any longer the ratio between the Laplace 
transforms of the pair input/output. 


Linear Time-Variant Systems 527 


Let De be the set of n degree single-variable systems described by the 
t 


linear differential equation with variable coefficients belonging to a derivable body 
K: 


y tay) O +a, (Dy (1) = bou P (1) +b Qu" YH) +---+b, (QUO) 


with y(n) =d'y(n)/ar, WW) = dufat 


It is easy to show that this set, consisting of two internal operations (serialization 
(mas : ee d f 
and parallelization) is a body. Hence, the application of Bei on F(A) is an 
t 


isomorphism. 


Therefore, system Z can be formally described by its transfer function: 
H(A) = QA)" PQ) 
where: 


O(A)=A" +a, (02 ++ a, (0 , P(A) =b” +b (A! +--+ b, (0) 


16.3.1. Properties of transfer functions 


THEOREM 16.1— two transfer functions H\(A) and H(A) are equivalent if and 
only if there is a polynomial DA) such that [KAM 88]: 


P(A) = DAPA) and QA) = D(DQ, (2) 


The transfer function H(A) is said to be minimal if the numerator and 
denominator are first on the right. 





d 
1 Henceforth, 20) will be simply noted by È. 
t 
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16.3.2. Normal modes 


If we make the homogenous equation: 
YO Ota, Oy ++ a, OVO =0 


correspond to the polynomial equation Qà) =0, we show that if q„(t) is a root of 


KAO 
QÀ), then e° is a normal mode of the system. 


EXAMPLE 16.2.- for yr) =0 which corresponds to: 


G4 PAY VkeR=0. 


[+d 
A root vA SE provides the normal mode: e° =t+k (well known). 


16.3.3. Stability 


The solutions of the homogenous equation form a vector space of size smaller or 
equal to its n degree [AMI 54, ZHU 89]. 


The general solution is written: 


fa (t)dt 
y(t) = Vice’ 
i 


We infer that the system is stable if: 


limA(g())<0 Vi 


too 


where À means real part. 
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16.4. Algebra of non-stationary linear systems 

A major interest in the transfer function is due to the possibility of easily 
calculating the transfer function of associated systems, either serially or in parallel. 
16.4.1. Serial systems 

Let >, and >; be two systems of transfer functions H; (À) = Q) A PRO and 


HO) =Q> A) PB (A) respectively; the transfer function of the system >) obtained 
by serializing >, and >> can be calculated as follows: 


Let M(A) be the RLCM of P,(A) and of QÀ): 


M(A) = ÈMA) = Qy (A) QA) 
then we have: 


H(A) = KWH (A) = 05! CP Or (APA) 
= 0 DB DB PQ P(A) 

= 0 DB MA PA) 

= 0 DB ADO (AQ) (A)! P(A) 


and finally: 
H(A) =[Py(AQ AT O(ARA) 
EXAMPLE 16.3. 


If EROAA and Ezi- A= nA 
H(A) =O '(ARA)zAtFY 7/4 MA) =G ARA = A-A! 
we obtain: M(A) = Py (A)Py(A) = Q1(A)Q,(A) =)+1/t and H(A)= a7! 


so È y(t)=u(t) 
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16.4.2. Parallel systems 


Let >, and >; be two systems of transfer functions H; (À) = Q (A) TB (A) and 
H(A) =Q> A) P, (A) respectively; the transfer function of the system È obtained 
by putting >, and X» in parallel is provided by: 





Let M(A) be the RLCM of Q)(A) and QÀ): 
MA) = À Qi A= On (AQ À) 


then we simply have: 


H(A) =H (A) + H(A) = OF (APA) + OF DB; À) 
H(A) = MT (A)(Q, A) (A) + Op (A) PQ) 


EXAMPLE 16.4. 

Zi wWOt+Vty,Q=uMand XY, yo()-1/tyy =u) 

Hy (A) = OF PO) = (+t)! Ha (A) = 07 APA) = AID 
gives: M(A) =A(A +1/t)=(A42/)A-1/t) = 2 +1/tA+2/t 


and H(A) =(A2 +1/t4 + 2/t) 12. +1/t) 


1 2 1 
or: Z y+ po + pO = 2(u(t) + 24) 
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16.5. Applications 


In this section, two types of usage of this algebra are presented in the field of 
modeling and control. 


16.5.1. Modeling 


One of the methods of signal and control processing consists of designing 
models capable of representing the magnitudes in question. Hence, models MA, AR, 
ARMA with constant coefficients have been very successful due to their capability 
to characterize a sufficiently large variety of dynamic behaviors. Their properties 
were the object of numerous studies and their applications are extremely diversified. 
However, these models are sometimes insufficient. A current approach is, similarly 
to the one that facilitated the traditional models MA, AR and ARMA, the research 
for new models taking into account the highly non-stationary character. 


We can illustrate this requirement for the modeling of polynomial phase signals, 
which we frequently encounter in physics, especially for processing signals coming 
from radar, sonar, etc. These signals are non-stationary with frequency 
characteristics that continuously evolve in time with variation speeds that may be 
significant. The dynamic model of the single-component signal is very easy to 
establish, whereas that of the multi-component signal is very complex. However, 
this is indispensable when we deal, for example, with the problem of multiple 
trajectories due to reflections. The algebra developed here makes it possible to 
create the complete dynamic model. 


Let us consider the following multi-component signal: 


n 


i + OL 42 
N= Xe D 
i=l 


i=l 


It corresponds to a signal (“chirp”) recorded from a main trajectory to which are 
associated n — 1 reflections. 


For each component y,(t), it is easy to obtain the differential equation that gives 
it: 


3,1) + (at+B,)y,()=0 viela] 





M a 
ïA- or +B, 
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However, obtaining the one that governs the sum y(f) requires the use of the 
results presented in the previous section. 


Formally, from: 


2 a 


er 





A+(a+BY ly,O=(BWIy,0=0 Vie [Ln] 


we obtain for y(t): 


[M(A)] y(t) =0 with M(A) = P.P.C.M.D.(P,(A)) 


16.5.2. Pole placement 
Let us consider the system (2) written by its transfer function 


H(A) =A !(A)B(A), is there a looping defined by O-'(A)P(A) so that the closed 
loop transfer has a dynamics set a priori by a polynomial C(A)? 






From A(A)y= B(A)u, O(A)r = P(A)y and u = -r + y, we obtain: 


A(A) y = B(A)u 
Q(A)u=—P(A) y+ OA) ye 


The solution to this problem goes through an intermediary problem which 
consists of seeking the controller in a factorized form: 


PAO | (A) 
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which is connected to the initial factorization by: 


O(A)P(A) = PAO) 
which leads to: 
(ADO) + BYP) y = ANDA ye 
In order for the closed loop poles to be given by C(A), it is necessary that: 
CA) = ANDOA) + BAPA) 


with: P(A)O(A) = OMPA) 


Which leads to the following algorithm: 
— solution of the Diophantine equation; 
— search of RLCM MA) of O(A), P(A); 


—P(r) and Q(A) are quotient polynomials of M(A) by OA) and P(A) 
respectively. 


The resolution of the Diophantine equation is done the same way as in the case 
where polynomials can be switched [KUC 79]. 


EXAMPLE 16.5.— let us consider the system Z described by the differential 
equation: 


yttyt+y=utu 
We choose CA) = (À +1)° 
The previous algorithm applied to: 


(07 + + DOA)t (AF DPA) = (À +1 
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leads to: 


t —-3+3 1 
(1-1) (=i)? 





ONE + and PO) = 


And the condition: 


QA)P(A) = PAOA) 








finally gives: 

4 3 2 
t -4t +61 -21+5 

DORE ane 
At) —3t° +31+1) 
2. 342 

paye #43 Gee =6 panna) 

t —3t° +3+1 t —3t° +3t+1 


which gives the following controller: 


tt -4P +6 -2t+5 
u ene) = 
(A-E -3t +31+1) 





Fees 
P -3t +31+1 





P -6r +15t+14 | 


(1? 3t+3)y+ 3 5 
t —3t° +3t+1 


16.6. Conclusion 


Due to the use of the algebra defined on the non-commutative body of rational 
fractions, it was shown that, not only could the concept of transfer function of a 
linear dynamic system with time variable coefficients be extended, but also the 
traditional operations on these systems had simple solutions, based on simple 
algebraic operations defined on the body of the related fractions. These 
mathematical results make it possible to raise and solve traditional control or/and 
identification problems. Obviously, the complexity of calculations is increased with 
respect to traditional systems (i.e. with constant coefficients) and, in practice, it is 
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necessary to use formal calculation tools. Finally, the approach presented here was 
to voluntarily consider the continuous-time systems but an analogous approach can 
be followed for the discrete-time systems (see, for example, Kamen’s works). 
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